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Commissioner For Patents 
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Alexandria, VA 22313-1450 

Dear Sir: 

I, Glauco P. Tocchini-Valentini, on oath say and declare that: 

1 . I am the same Glauco P. Tocchini-Valentini who is a named inventor of the above- 
identified patent application. Since 1998, 1 have been the Scientific Director of the A. Buzzati- 
Traverso Campus, Monterotondo (Italy, Rome). As the Director, I operate the following 
institutions: European Molecular Biology Laboratory (EMBL) outstation on mouse genetics; 
Intemational Centre for Genetic Engineering and Biotechnology (ICGEB) outstation on mouse 
genetics; European Mouse Mutant Archive (EMMA); CNR Institute for Cell Biology. I have 
studied RNA since 1962 and was a post-doctoral fellow with E. Peter Geiduschek, Francis Crick 
and Sydney Brenner. I have been a Professor of Molecular Genetics both at the University of 
Chicago and the University of Rome. A copy of my Curriculum Vitae is attached as Exhibit A. 

2. I reviewed the March 20, 2008 Office Action issued by the US Patent and Trademark 
Office. I understand that Claims 1 and 4-17 are rejected as obvious over Fabbri S, et al, 
"Conservation of substrate recognition mechanisms by tRNA splicing endonucleases," Science 
280:284-286 (1998) in view of Santoro S & Joyce G, "A general purpose RNA-cleaving DNA 
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Application No. 1 0/82 1 ,777 

Declaration for Response to Office Action 

Office Action dated: March 20, 2008 

enzyme," Proc. Natl. Acad. Sci. USA 94:4262-4266 (1997). I submit this Declaration to provide 
evidence explaining that neither Fabbri et al. nor the art provided the knowledge or motivation to 
a skilled person that eukaryal tRNA endonucleases could cleave trans-formed BHBs in non- 
tRNA target RNA molecules. 

3. Fabbri et al lists eight (8) authors that are as follows: Stefania Fabbri, Paolo 
Fruscoloni, Emanuela Bufardeci, Elisa Di Nocola Negri, Maria Baldi, Domenica Gandini 
Attardi, Emilio Mattoccia, Glauco P. Tocchini-Valentini. I was the principal author of Fabbri et 
al. and drafted a majority of the manuscript. 

4. In support of the obviousness rejection, the Examiner cited a paragraph on p. 285 of 
Fabbri et al, which she indicated as providing knowledge and motivation to the skilled person 
that eukaryal tRNA endonucleases could cleave trans-formed BHBs in a non-tRNA target RNA 
molecule. Because I wrote that paragraph, I respectfully disagree. 

5. At that time of Fabbri et al, we were not at all interested in producing trans-formed 
BHBs. The paragraph on p. 285 simply referred to the exciting possibility that in nature there 
could exist different cis-formed BHB-like motifs that could function as substrates (other RNA 
substrates in my sentence) for some tRNA splicing endonucleases. 

6. In addition, that paragraph addressed what is depicted in FIGS. 1-2 of Fabbri et al 
FIG. 1 showed cleavage of pre-tRNAs having the following structures: (1) a 5 -terminal 
phosphate group; (2) an acceptor stem comprising a seven base pair stem made by the base 
pairing of the 5'-terminal nucleotides with the 3'-terminal nucleotides; (3) a CCA tail at the 3' 
end; (4) a D loop comprising a four base pair stem ending in a loop; (5) an anticodon loop 
comprising a five base pair stem whose loop contains the anticodon; and (6) a T loop comprising 
a five base pair stem. The pre-tRNAs were cis-formed and structurally distinct from those 
shown in the application by having the structures noted above. 

7. FIG. 2 showed cis-formed mini-substrates, which are also distinct from those shown 
in the application by having a 6 nucleotide loop that is not present in the structures shown in the 
application). In RNA hairpins, like the cis-formed mini-substrates shown in Fabbri et al, the 
sequence of the loop is essential for the formation and stability of the structure. The fact that cis- 
formed mini-BHBs were cleaved by eukaryal endonucleases does not immediately implicate that 
the skilled person could form an active trans-formed structure and predict that an eukaryal 
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ptRce A»lon dated- Man* 20, 2008 

endonucfease would subsequently eleaye the trajis-formed stractee. In fact, xt was aot imtil 
2001 that my group started to consider trans-formed structures and non-tRNA molecules as 
substrates for eukaryal eiidoBucieases (sm, e.g., Fruseoloni P, et at, EMBO Report 2:21 7-221 

(2001); s^chei as Exliibit B). 

6. Fyitoro<»e, we fomd in subssqueat years that even more felaxsd fon»S of BHB are 

cieaved by 011^5 like the em derived torn 

G, er a^.FNAS, 102:8933-8938 (2(®5)jToc«-V^edentim <3, eti^l., PNAS, 102:15418-15422 
(2005); imdTp^ni-Valen^ G, et aL fHM, I04a2300-m05 (2007)5 *11 attached as Exhibit 

7. AS for Santoro & Joyce, I ficd iiat tiie Escamia^ wMterteiitly mischaractenzed ^ 
it ieacliss to the skilled parsou. Santoro & Joyce Mmm&d catalytic aueleotidestoowtx as 
DNAaymes, wHch recognize substrates based vapm Watsoft-Cndc base posing; &ey do not 
require protein enzy^^. In contrast, tFNA mdonudeases do not use W^tt-Crick base paifii^ 
for cieavage. Because Santoro & Joyce relate to DNAzymes, it cattBOtreaiaitt relevattt to &e 
obviousness of &e invention as a skilled person could not apply the teachinp of Santoro & 
Joyce to Fabbri et al and have achieved the invention. 

8. AcconJing to 35 U.S.C. § 25 and 37 CF.R. § 1 .68, we hereby declare that all 
stet^ents made h^dji Of ow own ic!K>wledge aie true. 

infoWMtioti aiBS bdBcf are believed to be true; md further, ti»t these statements are made with 
knowledge ihat wiilM felse stat^mmts, and Hie UJse so made, ars puaishabie by fine or 
imptisonm^at, or boih. wier Section 1001, im 18 eftbe United States Cod^ and &at suoh 
wiU&l false mcme&is my je^MSze the vaU#^ of &e «#icafion or any patsntiss»|ng 

thereoii. 
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CURRICULUM VITAE 



GLAUCO TOCCHINI-VALENTINI 

Place of Birth: Piandimeleto (Pesaro), Italy 
Date of Birth: 30 October 1937 

Address: Via Cassia n.240, Rome, Italy 

Married with 3 children. 



University Studies Doctoral Degree in Pharmacy, University of Rome, 

1959. 

Doctoral Thesis: A Method for the Determination of 
Riboflavines 



Post-doctoral Experience 

1958-59 



Institute of Biochemistry of the University of Rome (Italy) with Prof. 
A. Rossi-Fanelli 



1960 EURATOM fellow at the "Institute ftir Strahlenbiologie", Karlsruhe 

(Germany) with Prof. K.G. Zimmer 

1962-64 Post-Doctoral Fellow in the Department of Biophysics of the 
University of Chicago (USA) with Prof. E.P. Geiduschek and Prof. R. 
Haselkorn 

1965-68 Post-Doctoral Fellow in the Division of Molecular Genetics, MRC 
Laboratory of Molecular Biology, Cambridge (UK) with S. Brenner 
and F.H.C. Crick 



Periods Abroad 

1975 Visiting Professor in the Department of Biology, UCSD, La Jolla, Califomia 
(USA) 



1980- Various stages from assistant to associate and then full professor (autumn 
quarter) in the Department of Molecular Genetics and Cell Biology of the 
University of Chicago, Illinois (USA) 

Prizes and Awards 

1996 XXIV Scanno Prize - Province of U Aquila (Italy) 

1997 "Gaetano Quagliariello" Plaque 

1997 High Officer of the Order of Merit of the Italian Republic 
1999 HFSP - 10* Anniversary Award 
2002 Tartuf ari Prize - Accademia dei Lincei 

Affiliations 

- American Academy of Arts and Sciences - elected foreign member in 1994 

- European Molecular Biology Organisation (EMBO) - elected member in 1969 

- European Science and Technology Assembly (EST A)- member 1994-98 

- Human Frontier Science Program Organisation (HFSPO) - One of the founders of 
the Programme 

- One of the founders of the Human Genome Organisation (HUGO). Member since 
1990 

- Academia Europaea - elected member in 2000 



Professional Activities 

In 1962, hired as a Researcher by the Italian National Research Council (CNR) and 
worked with Prof. Franco Graziosi 

From 1968 to 1971, worked at the Institute of Genetics and Biophysics (LIGB) in 
Naples, Italy, directed by Prof. A. Buzzati-Tra verso 

In 1971, transferred to the Institute of Cell Biology in Rome, Italy, directed by Prof. 
Rita Levi-Montalcini. Since 1971, in charge of the Mechanisms of Gene Expression 
division of the Institute of Cell Biology. 



Current Position 

Since 9 May 1980, Director of the Institute of Cell Biology in Rome, Italy 

Participation in Recent Conferences and Meetings 

1996 EMBO Workshop "Frontiers of Molecular Biology" (Rome) 

1998 Meeting "Comite Editorial des Compte-rendus de TAcademie des Sciences", 
(Paris) 

1999 EU Consortium RNA Structure and Function, (Rome) 

1999 The Mouse ENU Mutagenesis Workshop, Monterotondo (Italy) 

2000 Seventh Session ICGEB - Board of Governors (New Delhi) 

2000 A European Programme for Centres of Excellence in Life Science and 
Biomedicine (Brussels) 

2000 European Science beyond 2000 (Lisbon) 

2000 E-Biosci Meeting, Heidelberg (Germany) 

2000 EU Meeting "Design of RNA domain, substrates inhibitors of tRNA- 
recognising proteins", Munster (France) 

2000 "Molecular Genetics and Molecular Biology", Symposium in honour of 
Peter Geiduschek, Pacific Grove, California (USA) 

2000 HFSP 23*^^ Meeting of the Board and Trustees (Strasbourg) 

2000 HFSP 24th Meeting of the Board and Trustees (Strasbourg) 

2000 Fifth Annual Meeting of the RNA Society, Madison, Wisconsin (USA) 

2000 Europolis Workshop, "Scientific and Technological Institutions and the 
Knowledge-Based Economy in Europe" (Lisbon) 



2000 International Colloquium on Life Sciences, Ethics, Economy and Society, 
Bordeaux (France) 

2000 E-Biosci Governing Body Meeting, Grenoble (France) 

2001 HFSP 25* Meeting of the Board and Trustees (Strasbourg) 

2001 Mutagenesis, Cloning and Disease Models in the Mouse (Rome) 

2001 HFSP Awardees Meeting, Turin (Italy) 

2001 EMBC - Strategic Working Party and EMBO E-Biosci Committee 
(Heidelberg) 

2001 CNR-UNESCO Annual Meeting - National Institute for Infectious 
Diseases "Lazzaro Spallanzani" (Rome) 

2001 EMBO Building Grand Opening (Heidelberg) 

2001 EU EMMA Press Conference (Rome) 

2001 EMBL Finance Committee - E-Biosci Governing Body Meeting (Grenoble) 

2001 EEC mice consortium Meeting, 16* August (Paris) 

2001 EEC mice consortium meeting 23^^ September (Paris) 

2001 lAPSS Conference (Rome) 

2001 EMMA works consortium meeting (Monaco) 

2001 Workshop "The Biology of the Post-Genomic Era" Trieste, (Italy) 

2001 ICGEB Board of Governors, Trieste (Italy) 

2001 EMBC/EMBL Meetings (Heidelberg) 

2001 HFSP 26* Meeting of the Board of Trustees (Strasburgo) 

2001 Italian EMBO Members Meeting (Rome) 

2002 E-BioSci Meeting (Frankfurt Airport) 

2002 EMBC - Strategic Working Party (Heidelberg) 

2002 EMMA Meeting, Monterotondo (Rome) 



2002 EU-6 Framework - pre-meeting (Strasbourg) 

2002 HFSP - 27* Meeting of the Board and Trustees (Strasbourg) 

2002 EUMORPHIA- Consortium Meeting - Charles de Gaulle Airport (Parigi) 

2002 40* Anniversary of EMBO (Heidelberg) 

2002 EMBC - 4* Strategic Working Party (Heidelberg) 

2002 HFSP - Second Annual Awardees Meeting, Ottawa (Canada) 

2002 HFSP - 3«i Intergovernative Conference (Berlino) 

2002 Oriel/ E-Biosci Summer Meeting, Wieslock (Germany) 

2002 EMBC-EMBL Meetings (Heidelberg) 

2002 EC, EMMA, EBI - Meetings (Frankfurt Airport) 

2002 EMMA Meeting (Munich) 

2002 Extremophlles 2002, Naple (Italy) 

2002 European Research Council 2002 (Copenhagen) 

2002 EMBC - Strategic Working Party (Oslo) 

2002 EMBL-EMBC Meetings, Hamburg (Germany) 

2002 HFSP - 28* Meeting of the Board and Trustees (Strasbourg) 

2002 E-BioSci/ORIEL Joint Meeting (Oxford) 

2003 E-BioSci Electronic Information - Programme Committee Information 
(Heidelberg) 

2003 EUMORPHIA Assembly (Strasbourg) 

2003 IPPC-EMMA, Monterotondo (Rome) 

2003 EMBL Strategic Forward Look (London) 

2003 EMBC - Strategic Working Party (Paris) 

2003 Life Sciences in the European Research Council (Paris) 



2003 Future of Mouse Genomics in Europe (Frankfurt) 

2003 EMMA Meeting, Monterotondo (Rome) 

2003 HFSP 29™ Meeting of the Board of Trustees (Strasbourg) 

2003 V European Symposium of the Protein Society (Florence) 

2003 DNA: 50 years on the double helix (Cambridge) 

2003 EMBC - Meeting of the Working Party on Special Project on Research 
Grants/ Awards (Paris) 

2003 EMBL Strategic Forward Look (Paris Airport) 

2003 ELSE - Meeting on the European Research Council (Venice) 

2003 34* Ordinary Session of the EMBC, EMBL and Finance Com. (Heidelberg) 

2003 RNA 2003 - Eighth Annual Meeting of the RNA Society (Wien) 

2003 New Frontiers In Nucleic Acid Research (Berlin) 

2003 Mouse Functional Genomics in Europe (Bruxelles) 

2003 ORIEL Partners - E-BioSci/ Oriel Annual Workshop, Varenna (Italy) 

2003 EMBL, Strategic Forward Look (Zurlg) 

2003 EMMA Meeting (Harwell, UK) 

2003 EUMORPHIA - First Annual Meeting, Slough (UK) 

2003 EMBO Members Workshop, Killarney (Ireland) 

2003 Tenth Session of the Board of Governors of the ICGEB, Trieste (Italy) 

2003 EMBL Council, EMBL Finance Comm., EMBC Meetings, Hinxton (UK) 

2003 International Symposium RNA 2003, Kyoto (Japan) 

2003 21st Anniversary of EMBO Journal (Heidelberg) 

2003 HFSP Board of Trustees (Strasbourg) 

2003 EUMORPHIA WP 10 (Rome) 



2003 EUMORPHIA Management Meeting (Frankfurt Airport) 

2004 EMBC Extraordinary Session (Heidelberg) 
2004 EUMORPHIA WP Meetings (Rome) 

2004 EUMORPHIA General Assembly (Munich) 

2004 15th Anniversary of HFSP/HFSP Board of Trustess (Strasbourg) 

2004 EMMA Meeting, Monterotondo (Rome) 

2004 EMBC Strategic Working Party (Heidelberg) 

2004 EUMORPHIA WP 9/10 (Rome) 

2004 International Regulome Consortium (Ottawa) 

2004 HFSP - Fourth Annual Awardees Meeting, Hakone (Japan) 

2004 European Research Programme in Bioinf ormatics (Bruxelles) 

2004 EMBO - 40th Anniversary Meeting (Heidelberg) 

2004 EMBL Council and EMBC Meetings 

2004 "In vivo and Transgenic Mouse Models'' Conference (Berlin) 

2004 EMBO Council (Heidelberg) 

2004 EMBO - Sectorial Meeting Bioinf ormatics and Computational (Rome) 

2004 Mouse Mutagenesis Mouse, Heathrow (UK) 

2004 EUMORPHIA Annual Meeting, Heathrow (UK) 

2004 E-BioSci/ ORIEL Annual Workshop / E-BioSci Electronic Information 
Programme Committee Meeting, Hinxton (UK) 

2004 EMBC 35th Ordinary Session, EMBL Council and Finance Comm. 
(Heidelberg) 

2004 Mouse Resource Centers Roundtable, Bar Harbor, Maine (USA) 

2004 ICGEB - Board of Governors (New Delhi) 

2004 Working Group HFSP Journal - HFSP/BOT (Strasbourg) 



2004 EUMORPHIA WP 9, 10, 14 (Rome) 

2004 Conference: "Funding basic research in the life sciences: exploring 
opportunities for european s)mergies" (Bruxelles) 

2005 Research Infrastructure in Life Sciences Meeting (Paris) 
2005 EMMAinf 1^* Meeting, Monterotondo (Rome) 

2005 MUGEN Kick-off Meeting (Atene) 

2005 HFSP Journal Working Group (Strasbourg) 

2005 EUMORPHIA General Assembly (Madrid) 

2005 PRIME SAG (Scientific Advisory Group) (Madrid) 

2005 Focus Meeting on the use of animals in scientific research (Madrid) 

2005 HFSP - Board of Trustess and HFSP Journal Working group (Strasbourg) 

2005 EMBO CouncU (Strasbourg) 

2005 EMBC Strategic Working Party 

Workshop on EMBC/ EMBO and Improving Life Science Research in Europe 
(Berlin) 

2005 2"^* Mouse Resource Centers' Roundtable (Rome) 

2005 HFSP Fifth Awardees Annual Meeting - HFSP Journal Working Group - HFSP 
34* Board of Trustees (Bethesda, Maryland USA) 

2005 EUCOMM and PRIME Meeting (Frankfurt Airport) 

2005 EMMAinf, 2"^ Meeting (Stockholm) 

2005 First EUCOMM Meeting (Frankfurt Airport) 

2005 EMBC 36* Ordinary Session (Heidelberg) 

2005 Priorities for Mouse Functional Genomics in Europe - Promotion and 
application of the EMPReSS phenotyping platform (Frankfurt Airport) 

2005 MUGEN - 1st Technological Workshop on Transgenesis (Marsiglia) 

2005 EMBC Strategic Working Party - EMBC Workshop (London) 



2005 FIMRe (Federation of International Mouse Resourcers) Meeting (Washington) 

2005 Eumorphia - EuroMouse Conference 

PRIME Open Scientific Meeting (Venezia) 

2005 ICGEB Board of Governors (Trieste) 

2005 9*^ International Symposium on Parkinson's Disease 

Neuroscience 2005 - 35*^ Annual Meeting (Washington) 

2005 EMBC-EMBL Meetings (Heidelberg) 

2005 Eumorphia - WP 9 , 10 (Rome) 

2005 ESFRI - Expert Group on Genomics et al.- 1^* Meeting (Bruxelles) 

2005 2^^ EMM Ainf Meeting - Monterotondo (Rome) 

2005 Working Group for HFSP Journal (Strasbourg) 
HFSP - 35*h Board of Trustees (Strasbourg) 

2005 EMBL/EMBO/CNR Mini-Symposium on ''Genes and Behaviour" - 
Monterotondo (Rome) 

2006 ESFRI - Expert Group on Genomics et al. - 2^^ Meeting (Bruxelles) 
2006 EUMORPHIA - ?>^^ Annual Meeting (Barcellona) 

2006 EUCOMM - Kick Off Meeting (Munich) 

2006 ESFRI - Expert Group on Genomics et al. - 3^^ Meeting (Bruxelles) 

2006 EMBL Council (Heidelberg) 

2006 HFSP 36th Board of Trustees 

HFSP Publishing Board (Strasbourg) 

2006 EMMAinf Meeting (Harwell) 

2006 HFSP 37th Meeting of the Board of Trustees (Parigi) 

2006 EMBL Council (Heidelberg) 

2006 EuroPhenome Meeting: 1^* and 2^^^ Phenotyping (Monaco) 

2006 1^* International Mugen Conference ^'Animal Models for Human 
Immunological Disease" (Atene) 



2006 EBI-EMMA Group Meeting (Bruxelles) 

2006 Extraordinary EMBL Council and Finance Committee meetings 
(Heidelberg) 

2006 EMBL - ATC , Groundbreaking Ceremony (Heidelberg) 
2006 EUCOMM submeeting (Monaco) 

2006 EMBC, EMBL Council and Finance Committee Meetings (Grenoble) 
2006 ICGEB, Board of Governors (New Delhi) 
2006 HFSP - 38th Board of Trustees (Strasbourg) 

2006 5**^ EMMAinf Meeting - Monterotondo (Rome) 

2007 EUCOMM Annual Review (Monaco) 

2007 FP7 - First Meeting of the Programme Committee for the Theme 1 : Health 
(Bruxelles) 

2007 EUMODIC - Start up Meeting (Barcellona) 

2007 Workshop "The future of Research on Mouse Functional Genomics" 
(Bruxelles) 

Scientific Societies, Commissions and National and International Working 
Groups 

- European Molecular Biology Organisation (EMBO) - elected member in 1969 

- EMBO Council - member since 1981 

- EMBO Council - President, 1985-87 

- EMBO Journal - Editor in Chief, 1985-87 

- RNA, Editorial Board Member, 1995-96 

- European Science and Technology Assembly (EST A) - member 1994-98 

- Human Frontier Science Program Organisation (HFSPO) - He is one of the 
founders of the Programme. HFSPO Council - Vice President, 1990-93 



- Italian Society of Biophysics and Molecular Biology (SIBBM) - member 

- Italian Society of Microbiology and Microbial Biotechnology (SIMGBM) - 
member 

- Human Genome Organisation (HUGO) - member 1990- 

- Human Genome Organisation Council (HUGO) - Council Member, 1991-93 

- American Academy of Arts and Sciences - Elected Honorary Foreign Member, 
1994 

- Study Commission on the International Relations of the Italian National Research 
Council (CNR) -member 

- Inter-ministerial Commission for the Co-ordination and Application of 
Biotechnology of the Italian Ministry of Health - member, 1995 

- Scientific Committee of the "A. Buzzati-Tra verso" Foundation - member, 1995 

- Study Group for the Evaluation of the Activities of the Research Organs of the 
CNR - member, 1995 

- Institute of the Italian Encyclopedia "G. Treccani" - Member of the Advisory 
Board "Frontiers of Life" series, 1995- 

-Technical-Consulting Commission EUREKA Project, Italian Ministry for 
Universities and Scientific and Technological Research (MURST) - member, 1995- 
96 

- International Association for the Promotion of Co-operation with Scientists from 
the Independent States of the Former Soviet Union (INT AS) - expert, 1996 

- "Comptes rendus de TAcademie des Sciences" - member of the Editorial 
Committee - 1997- 

- CNR Study Commission on Cloning - member, 1997 

- CNR Study Commission on Bioethics - member 

- Human Frontiers Science Program (HFSP), Member of the Board of Trustees, 
1996 

- National Committee for Biosafety and Biotechology, Presidency of the Council of 
Ministers, Work Group for the Evaluation of Biological Risks - member 

- Italian National Commission UNESCO - member, 1998- 



- MURST Work Group Biotechnology Sector- member since 1999 

- Secretary General, European, Molecular Biology Conference (EMBC), 2000 

- Academia Europaea - Elected Member, 2000 

- CNR Representative at the Italian National Commission UNESCO - 2000 

- MURST Post-Genome National Commission - member since 2000 

- MURST National Commission for Neuroscience - member since 2000 

- CNR Member of the Coordination Committee, 2003 

- ESFRI (European Strategy Forum on Research Infrastructures) Member of 
the expert group, 2005-2007 

- Member of Italian National Expert Group, Health-Cooperation Programme 
Committee, European Union Framework Programme 7, 2007 

- ESFRI (European Strategy Forum on Research Infrastructures) Italian 
Delegate of the Ministry of University and Research, 2007 



ORGANISATIONAL AND MANAGERIAL EXPERIENCE 



Since 1980, G. Tocchini-Valentini has been Director of the CNR's Institute of Cell 
Biology in Rome. Since the onset, the Institute has been divided into Divisions and 
Services. It currently employs 52 persons in research activities (37 with Doctoral 
Degrees and 15 technicians) and 7 persons in administrative and secretarial work. 
Since 1998 he has co-ordinated the organisation of the new international scientific 
Campus "Adriano Buzzati-Traverso" at Monterotondo (Rome, Italy), which was 
created thanks to a CNR project that has involved also the most important 
European scientific organisations. The Monterotondo Campus was created by the 
CNR for the purpose of developing and internationalising Italian biological and 
biomedical research, and is named after Prof. A. Buzzati-Traverso, the scientist 
who, while working for the CNR, brought modern molecular biology to Italy. The 
CNR transferred the Institute of Cell Biology (IBC) from Rome to the 
Monterotondo Campus, where research teams work in close collaboration with the 
international institutions present on the Campus: the EMBL (European Molecular 
Biology Laboratory) and EMMA (European Mouse Mutant Archive). 

In Italy, he has been the Advisor for Biology of two Ministers for Scientific and 
Technological Research and has covered numerous organisational functions 
within the CNR. He has been Director of both Subprojects and of ''Finalised" 
Projects. 

In Europe, he collaborated in the development of framework programmes and has 

been a member of ESTA. In particular, he contributed to the development of the 
concept of infrastructures in Life Sciences. He has been a member of the EMBO 
Council, of the EMBL's Scientific Advisory Committee and is currently general 
secretary of the EMBC (European Molecular Biology Conference) and a member of 
the EMBL Council. 

He is one of the founders of HUGO and has been a member of the organisation's 
Council. 

He is one of the founders of HFSPO and has been a member of that organisation's 
Scientific Council. Currently he is a member of its Board of Trustees. 
He has participated in several G7 meetings for Bioethics and was president of the 
"Human Genome" meeting. 

He has taught at both undergraduate and graduate levels at the University of 
Rome and at the University of Chicago. 

He has organised numerous courses and conferences on behalf of EMBO, UNIDO 
and HFSPO, and has given numerous seminars in Europe, the United States and 
Japan. 



Positions of Responsibility in Foreign Research Projects (recent) 



1996-1999 Holder and co-ordinator of the European Framework Programme 4 
contract "EMMA-European Mouse Mutant" 

1996-1999 Holder and co-ordinator of the European Framework Programme 4 
contract "EMMA-European Mouse Mutant Archive" 

1998-2000 Holder and co-ordinator of the European Framework Programme 4 
contract "EMMA-Resource Database" 

1998-2000 Co-holder of the European Framework Programme 4 contract 
"Design of RNA domains. Substrates or Inhibitors of tRNA 
Recognising proteins" 

2001-2005 Co-holder of the European Framework Programme 5 contract 

"ORIEL - an Online Research Information Environment for the Life 
sciences» 

2001-2004 Holder and co-ordinator of the European Framework Programme 5 
contract "EMMAworks" 

2001- 2004 Holder and co-ordinator of the European Framework Programme 5 

contract "EMMAnet 

2002- 2005 Co-holder of the European Framework Programme 5 contract 

"EUMORPHIA» 

2004- 2008 Holder and co-ordinator of the European Framework Programme 6 

contract "EMMAinf " 

2005- 2009 Co-holder of the European Framework Programme 6 contract 

"MUGEN» 

2005-2008 Co-holder of the European Framework Programme 6 contract 
"EURASNET» 

2005- 2008 Co-holder of the European Framework Programme 6 contract 

"EUCOMM» 

2006- 2008 Co-holder of the European Framework Programme 6 contract 

"EUMODIC» 

2006-2008 Co-holder of the European Framework Programme 6 contract 
"CASIMIR» 



Scientific Co-ordination Activities 



1981-1985 Co-ordinator of Subproject 1 of the CNR's Finalised Project "Genetic 
Engineering and the Molecular Bases of Hereditary Diseases" 

Member of the CNR's National Consulting Committee on 
Biotechnology and Molecular Biology 

1995 Study Commission of the Finalised Project "Applications of 

Molecular Genetics to Human Health" 



1997 Co-ordinator assigned by Eurohorcs of the "Survey of the strengths 

and weaknesses of European science" for the biochemistry, 
molecular and cellular biology sectors 

1997- Member of the Scientific Committee of the President of the CNR 



1999 Co-ordinator of the CNR-MURST Strategic Project "Molecular 

Genetics" (Law 449/97) 

1998 Co-ordinator of the CNR Strategic Project "Basic Technologies of 

Post-genomics " 

1999- External member of the Scientific Council of the CNR Institute of 
Biochemical and Evolutionary Genetics at Pavia, Italy 

2000- Co-ordinator of the CNR-MURST Project "Functional Genomics" 
(Law 449/97) 

2000- Member of the Co-ordinating Unit of the MURST-CNR Sector 

Programme "Biomolecules for Human Health" (Law 95/95) 

2001 Co-ordinator of the CNR Study Commission for the MURST 
Strategic Programme "Post Genome" 

2002 Co-ordinator of the Italian Basic Research Fund (FIRB-MIUR) 
network project: "Mouse mutants, Phenotypes and Human Diseases 
- Mouse PHD" 



2005 Co-ordinator of the Italian Basic Research Fund (FIRB-MIUR) 

International Project: "CNR-IBC/G. Armenise Harvard Medical 
School 



DIDACTIC AND TRAINING ACTIVITIES 



1977 Professor of General Genetics at the University of Rome "La 

Sapienza", Rome, Italy 

1980- Professor of Developmental Genetics in the Department of Molecular 

Genetics and Cell Biology of the University of Chicago, Illinois, USA 



National and International Courses: Organisation and Teaching 

EMBO Course on Genetic and Molecular Analysis of Development, Rome, 
October 8-23, 1979 

EMBO Workshop on Eukaryotic RNA Synthesis in vitro, Rome, October 1-3, 
1981 

EMBO-NSF Workshop on Eukaryotic RNA Synthesis in vitro, Rome, September 
25-27, 1983 

- EMBO-NSF Workshop on Eukaryotic RNA Processing, Rome, May 22-26, 1985 
Workshop on RNA-Protein Interactions, Urbino, June 29-July 1, 1987 

- Workshop on Molecular Genetics of Yeast, March 20-31,1989, ICGEB, Trieste 
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Cleavage of non-tRNA substrates by eukaryal 
tRNA splicmg endonucleases 
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rykaryal lJ?NAjiplicjng endonucle^ises; use the mature domains 
of pre-tRNAs as ihmr prmiary recognif>DJ> eiements. Howevt?r» 
ihey can also t'teavtj m a modi? that b ind^p^od*?m of the 
mature domain, when substrates are able to form the bulge- 
hcslix-bulge structure (BHB), whkh h cleaved by archaeal tRNA 
emclonudeasesv We preiient tjvid^na^ l\\<xi Jhe eukiiryal 
enxymes cleave tfiek ^^ubi^trales^ aittjr f(jrfnu)g d structure that 
fes>emble$ the BHB. Consequefitly^ the^^e ^^nsym^^s can c^eave; 
substrates that lack the m^itare domain altogether* That raises 
the possibHity that ihm^ enzy'nu*s coutd also (iltjave non-tRNA 
substrates thai already have a BHB. As predicted, they can do 
st>, both in vitro md in vivo. 

INTRODUCTION 

Accuracy sf^ iRNA sf);lk:5ng <?ssf?nti:5l for 5b€? ib:rniisl:tan of fus^c- 
tRMAs*. aod hence fo!" ceH viabsSity. U\ hoik Afch^ea and 
f^uk^irya ihe speclrktty of spiictng resides m recogrsJtior^ ot tRNA 
pr^icijrsors by JRNA splicing en(ioBuc:l?^3se?> {Belfort md W«;lmif. 
1 997; TfoJ'ij'ai^d Abelscjf^ 1999}. A^■^:ha^:<il: tRNA f^phcing ends>- 
nitcbs^^^s cla<5ve pf'^f-tRNA^; only ussng an RNA stfiicture 
cortipfiji^rd ol two bx.il^'i^s of thr^j:^ f^5.jcSeoii:d*?s ^>^ch (vvhet'e 
cleav-jge occurs; ^^^par;ii^ed by iour b■J*i)^ paff$. Thi^ sjr^jcture, 
calkd ihe [)u[ge--hetk--bulge (BHB) (Pigi^re t A, 2} (r>n:nseis et^iL, 
1 9B5; Dinner md Moofe, 1996}, functiors^ indqxifidendy of \he 
pm of th0 molecules th^it CDn?;t5tiite?^ the malure (RNA, g-o we 
rf?fe:t' to this ?vpff of rf^cognitio>n: of the clf?;5V^tgf? iters as being the 
ma5U^^^-domam ijvck^p^indtrni; fv^ockr. In conljariJ, ei^karyal tRNA 
splicing cndonuc:^^^!^eii^ lequsrc ^nie^-actian wish tfic maaffc 
tRNA don^.^in for onentetion, s^o we refer to tiiBt reeognibon as 
Jh«? mAtL5re?Klom<=i5n dep^ndeftt tnode (MattDccia et air 19^8^ 
Reyefi and Abf^kon, \ 9H'^). 

Recognition f>T pre-tRNAs by eukaryal tRNA spliicfng endo- 
fHj<:Je;jr>es nc^rnfislly rt^q^nies Ihe frjatuff? tRNA dofnaui, as wej:^ 



which !$ k^rn^ied b&tvv^en siuclef^tsde^ m the imilcodon \oop iHMl 
fhe (ntror-j (Bakii ei* a/., 1992/. The A-i pair iTsust be at a i'mKi 
dbVtim:^. frorn the nm^re dofnaJn for cleavagt? to occur arjd 
cleavage near thl!^ base pajr generates the 3' spfice site. An inde- 
r)endent ci<?avctg<? evwt also ctt a tlx^^d distance imm the mature 
domain {L^su^iHy at a purine), gen era leer fhe F/ hplk.e f>i^:e. 

The Jvvo mode^ ^nb^irate fi?cognUion are characterised by 
two distances, in ^he ma^ur^-dornjjin IndepemJersj mxm the 
heij.x o( \he BHB ihe distance betw«^:n tf^etvvo bulges; m the. 
maturf^domain dcfpendent mode the distance is fmid relative^ to 
reference? In Jhe m<jfijre domain- 

Whih Jbe subuntt structures of the eukaiyal and arch."iea{ 
enzymes ditfer signifsctnttsv {TroU<3 Abebon, 1999), a^ do 
the superficial s^mttures of cieavjjge we 1^,5 ve deinon- 
&tfat5?d thai both t\w Xenapm and yeast tRNA spficing mdo- 
nuciease&can operiUe sn the fnalyre-donia^n independent mode, 
cbafjicsertstic of Archae.«3 {VMy^i et bL, \99B). The? resulrs 
reported fn fhi^; papes" exfjlaiF^ why the eukaryal endc3nu«:ieas«ts 
retain the abilify opera ie jn the fnatuf^^domajn independent 
mode svhen their natural substrates do not have a BHB, 

RESULTS AND DISCUSSION 

The artificial sisbiy-raic pre-tRNA^'--'-^---'^= contih^s both ti jrsaJure 
dopnain and is BbIS {Figs^re 1 A, 2). The eukaryijl enzymes dmyv. 
ihe subsitrate with a ts\<o ba^e-pair ifBeri in tbe antlcodon stem, 
2bpVas (rij^ui'e 1 A, 4;;,, onSv '^"^ the niatiire-doiYsiiin independent 
m()de (Pabbrf el ^L, i 998j. The siU^s oi cleavage by the eukaryal 
cfj^yn^e 3 re n>;ed by recognition kx:al BHB skuctur^i raSher 
\hnn by reierence tc? the mature don),5sn, 

Thi) X0nopm Uuwk endontjckiase cm ako cleave? in vim In 
the makire-donv^iii indeperH^ent mode. When prc^-tRNA^-^'^'-'-"''^^ 
and pre-tRNA^ "^'^'^^^ 2bp^is were injected snto Xawpus oocyte 
nucleh both sub^>5raier> were s^piiced and f ixated. Tf^esize oftfie 
maU;ni! tKNA^'^'-^-^^^^ IbpVaj^, which is kmr bases kjnger thijrs 
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§1 



Si 



1^1 



^tj. J. (A) The ijiteraciiou, A toa^;i:TVi;t1 pwri^K \x%\'Xw m ih^ \mm\ o:ucic^?l34c:;^ fr^:fj5i the 3' vka^^'it^e sttc K R s?? 5x^;s> tisuji paif a. 

pydnikiinc i« i^K^ sa{i^'c>ki;">i"t :;x>p 6 ^^ucktjtidt:^ ujJ5^f>:'^tm of dsc 5' ck^E^Ya;^t^ ^.itc (ffiokx^uk^ I. Y m box) form the A-l (Rii" atiiki^di^-ii-iii^rcii) intcvaciicii (Baidi ei 
<{L^ BHB (:rw.a5.eci53o 2). 'tV'O bu^^^ft of \imi> \v,Kl<:<Miki> ^mh (wtioro ckaviigs; i^ccwv?) ragkJJy A^^p<jrtiJc<1 by (oxir b-^m p<iivst vDa«.ieJ:!v <-i'o.L, 19^5', Dlmi^ ai^d 
Mvw:^'^, BHiArmfl^^^'^ik- '}). A sl^^i^e-fiisetoiiJi;. -r scifc bdi^r;, alow: i>y^^^;>yj:f hvriix ii bo5>contii!.w.ng ^iic 5' :f:*t,^-lRNA''^-^^'^^^^ und lih vmkinU {nv^^k^wh 4), 

<Jv.>j»tti« wkh ck'aviigi? vjtcs ioctt?i'rf ?rt tk' pr*?sci jkxi ;1is^345t:e ffow* the reference? dt'«jcnts and w t-yrrcv Uy-posiiknit'tJ A-i paif, i^ll v^f which ^hywld 
oiisiifL' tuwtfti re*;ogiiitJs:>fs by the tHiSsisyyyi s5fnioftwckMiAt^ wha-fi the oiwvfsjc operates in t}jt< rffaUu»fi«f«<!3« de^^t^wisjn? fsmk. hi addition, iiie prc^te/fce of the BJfB 
mo»t' cmfarsi niiMr^siit ohai-acrefistks thiii ai-e wcog;.iwribti^ by tht* 4;tikar>'SLl. sw^syftt^i whfi« it ojwi'iitfts m. ihe wait ttrs -clowidw i«£lep«flt1i^«f mfMiiJ, (B) A Siibsu^ie 
cilcavefS 555 boih the mattiv*;-i1i->f!fjiit« tiiipsftiient ifee u>st«r$ dotxmn wdeper3*ie«t l*?v^d«CTS ot Mig'sstion by th*> Km<-:fm tHNA si>Uci»g endowucteasc. 



ny^iuw wild-type tRNA''*^'> indicates that the Xenopus- enzyme 
deiwe^ in the mature-domatn hidepf?j>d*?nt mcxie? in wo\mi m 

it (ifx^s f« Vitro (Fs^ure 5 A). The $ubst;:tnJbl atru^unfe oi mch 
pmcuri^Oi €c<pott£d bi:\om dmvrif^: probably tesuM iam satijra- 
t>c>fi of nuclear fet<5r3t:ion (Arl<r et i^L, 1 998; Lujxi m<{ Dablim'^ 

Some^ substrata?* am clawed in both ftx)d^s. Pr^stRNA^'^'--^^-' 
IbpAi-ts (Psgiirtj f B), whicK h*"i^ a thif?e bA^<?ypAif cid£^t30rt m the 
;jj>nc<:Kknj $t«tfn, i.s ciJenvecl m both sTJocles. \n this c<i*-a ^he iwo 

nwde?^ ysekl distincri praclucri sf^^s, <incl bolts ^Ke c>b^^?rveil Tv«.'0 
tnirQ!'j?> am vi?>ibb in pjguf^^ 1 h^ne 1 , One of tht prmkiclr^ Is not 
prodiicml in the C56G mis^nt r^frkcttr^g \he inability of ihe 
en^yt-iif^ to c^^^.av^f a Rib^t^ValjfS thsit cannot io^m rtormaJ n^akt«? 
(iomain in the mature-doniasn dependent nijxie 

We na\v propo-^e ^h;5t the orsenuitson the sub^^tr.?:te in the 
iicih-e $ite of the euk^uy<i{ enzyn^^^ }-equ{r^i$ the brni^tisoft of ;i 



structure that resembles a BHB; the A-j pan would play a pivotal 
mk? in this pmcessi, a$ it represenls tb? clo^sng base p^sir of mr^ 

d the bulgf?^^. This nmM pnA.t^ that recognition the niature 
tRNA dom:ain by a eijkaryal tRNA $p! icing endonuckiasct; ,iHow$ 
sttb$equ{^firtof'm;itfor5 ^>t ^3 Bf-iB-ifkc^ oleivage ^tatctore. 

In addition to tha A l pm (Baldi elai, 1 9921 CFther irelk;^ of she! 
afchasijl worid provide? tmighl Into ihe m^hanhm of the 
<;ukiir>'<i3 clefisvage; feiActlon, Sotne «?ukafyisj fjre?-tRNAs pft'ssent 
moWh that resemble the BHB, The s^^iijuence of ihe C.^emrh<ti> 
cfitis eleg^ns genome shows th^it tRNA genes corresponding to 
jht'f^e isqacceptor sipecfes^ (Leu, Tyt', lie) con^^it^ introns (The 
Ce/ei^<vN^j,*j Sf?qt3(?n<:ing Consoii:uin"^, U)98), The nern^dode pre- 
jRNAs present a mc?tii' which vve aili BHl ^ Figure 1 A, 3), which 
msembies ihir 8MB In ihiW \t has !he site buSge umi the bur 
base-pair helix but the S' ^^iie h se a kjop r[3ther fhaf^ h"i bul}^?. 
]1x^s.e three intton-contatning pr^^4RNA!5^ of C ehgcii}$ are 
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injm^l p.tH:ii?.rv pre -fRN A jrLiis.script:s dM ofxhe nmat'i tRNAs wea^ rfewniiiseii hj- asaiy-ils ^?^ ^n^^ ijyci$;tri'N) ;«jacyiH>pk=i«fn?<r <C) RNAs. T.i^^? uiji*et$<J p*?fuf.Kor 
RNAs ill the CYiv^piiiAu^ pfKibi:b:y fv?s«Ut\J ixmi kwfTKVwfft ijui;k;w re}enij4:>¥i. inpvi. (B) Ilia ya^vii srfsrf4->nwcl<fast^ uuu»jjs. cim«& pi-e4f^NA*''**'"'^-' i^t both 
^im^ Mi->\<i<A I , pK^-iKNAf aioisx'tjfc 2. pn>mNA.*^'«-^^*; K*f>Jscwie X pfevt^iNA'-'^'***^'^ 3J>|>VaHn, 080, G24C. X, XV«<>i>w v hm^h: X yeast ^f. m¥i'f.«*«<' wM 
type*.'. Y* yesj^t 5. treivmiiie fiett2-'.?. Tfts s^ft2':^ pfeparati-rsft wa$ a>mami«iiie<J wiih n :V t^KO»Lioii3fjUtjc: aeiiviiy ilisi p;jmdjlv <.1egf;Jfli!d th^ 3' fjf ihe pfe<:»<'5iov, 
3>^c:k3<:«5g t^'se si^e csf ifte y hi\\f predict. "Hie serjttfftice?; t>f iht^ pvLidiit-ii; ih^: i^iit^'i^ix e:^c:sk>^ hay& ten ^^e^■i^^^rd by lis^§erpvjrt?i^)^ (dmri rtf?i i:,h<>wn 1ft she 

j?ej?2-<> m«ra«i. cm mvixtc in Iw^p 1.7. OJy^^^ diiitiged u> CM {Tnym mi Ah'^]m\, 1999 ^ Loop ? ctMiS5r?s a ttt^tUHnc rcsjilu-^ that Is ahsclateiy cf disserved in all 
tRNA iffKk>«^5c5e;^se^,. find thai prohfshSy fto? s gii^^iirraJ ^■^i^iff hy ikpfim"iHiiii% ihii mii:ico\)in\c 2'"-[tydroxy3 ^nmj} (Tmi^imd AbcHi-m, V^W). The tusldivifs Oil 

(■.tola ;3nd A!>eh^>ft. 1999), (h) P^^-tKNA-^"^'^^'^'^ AfchiieA^ en?:ymt^ riVoji.s ai^d Aix:i?it>n^ '3999). (c) 'BHB Etij^ftrvfti t>o.^ym^?. A psr^posiiS U)r ,[;;>s^ s^ymmi^try dstrij^g 
volution of ihiJ Ij^tfv^n ^^Kcissaij re[iCt:ioa. Arc;ha<?i4v ihi-i n-^-ognmon cferrsi^at in pm-^RHA l< ^hi^ BHB^ which p?ftJi:<;!0-iss^o-fv)fvf :iy3:ameir^' 0>fi^f^i:r iift^j Mmm. 
199$: Tfv?u;i '^nd Xo<:.hm^ 1999\ SiiKt the ctKly>iwckai>e ;kx--'^ m^i bhal \hh^ ffiiiUtre dosfi^iij^ <>f pf\:-iRNA> xIk <^nzyn\c h mmii>^ii m ^^<h a. way ih^ bi^ih isttK-'c 
can ckavx^ either of vh^ imron-^^i^js j^!.v<;OvV}?<, fj?).. The prif«ary ft'oo^sjftittoft oicnieiU of the e«ki»iyiiJ. eisiloftuckiitie, o« fhe oih<n' hmd, is isk AS-yiis.wi'SUk-alJ.y 
;fuyj5i;{i ol f?f<^-tRNA; iiilor^cibu wiih i\m ik^mrim impi^i^h m orioniiiiiow af fhe e^jzymo on ihe :'jiibsJ,raie,| ho k\c\m:- sm^ is H:- ofsc 

■orihe Oiim imroix-<^.xQii j»nt:oo5^$. h <ibmrcit- en ii raaiu;^ clojmfrt, as mih ihe wwi-BHi^ (f"a:hhri i't<i(,. i99T\ ihe ir-ii'karyal ejjzymo: is fre<? tc^ rt;cog:!;i^<; ps<?wi:3!:^ 
iWi^^'oki sy?ji?ji^!5:rk: ij]tJ0^eni:>: ?n ih<-i mb<inii<i. so tJ^^t h'^nii iii (^ii^^y:"jie c^aj^ bifiiJ to thither J^ioeuo:": (c), Hovvevijr, whej^ ^ Nub:>Ei^J^ has rrsi^tiife: 

iHf iikii^y ti><>riewllhtt eH?.yme (a). We pii^poss? thst vkHi«gtn«isjt.i\^iV: imce yff;s:h>n«eSea.se* we^; j^tjffjto retx>^!.!.l5fijf^.e ;iia?:iif<^ tififfSAin, the tbf a syjiJiiMfffk- BHB 
fi?ce'i>i«iir>n ske <jijJiwU¥heii; k>v;ev-er. ihe acJhe siieA of the ey^;;jf y fji eiii/yi^ie?> vve;^> ;«3JjJSi*irt*it'!. aUrjwiug. to ch^ave pft^-foifrted BHB sstii^ciufti^. Becft«s<5 Jfje 
odiiaiauou of the xmy de;jV3^.e siti*^: in thii sn?;yw>& pewaiRed cowsiatai. the ewkaryaJ pye-tRNAs Jiad to isiams^sir! tfci« Miity to fortti a BWB-Jike .stf«ct«fe wpoj3 
bsrjiijrjg iha otik;3ryaj ers5^yme: pm oi'that riJtjutrL'tnt^tt^ is seew m ifti? A-f base py?rxxigniie and jw j^l* Bi(L. 
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cleave;d cort-ectly by both yeast cjnd Xenopm endonuc/leases, 
we^ m the Parmcms e<iuorum^NA splicing en do nii cleave, but 
ViOt by \hii archm^il imzyme- idiit^ not 5^hown). Thus, the ody 
imly u?i!ivc?5^3[ sisbs^rv^te h m RNA wixh a BHB (J^ibbs i et 

Because they can boJh ctesJve the BHB. *e ^rchaesi and 
euk^iry^i] endanufrleases are Hkely t^) h^ive fdentic^sJ dfeposittionii 
oi active sftf^s, a feature conjierved ssnce their dsve^'gtjncB from a. 
coj^trnofi ancestor' (Tfmn and Abe^ii^(Jn, 1999; Fabbd 
^9$B) i'Flgwfe 2Q. We suggest that the matufe'domasn 
dependent moik. arose through specianztitlon of th«^ subunits of 

We p>^o|x>sft th^J the f?ukai-yal e3i)?yniffS posse.?^ ^ mmxfe-^ 
dotmm ikpemimt F/ ake res\nu:timmi ^K:Efv?ty {D:i Nicola el 
1907). Such an actsvljy would be i'Ov the ia^^^ s^ieps of 

substrate fecogmtjon by Jh^^ eukarvj^l eir^cytf^e^, recapikilatrng the 
rec:c)gr5iiion pfoceJSiS ot their atcha*?ijl ajunjerpisrts. The 5' site 
rf?strLJCkjdng ;ictivhy h rtot nm^ed to deavft the Bi \B bec^u^e it 
alre^ady has a <::c>rrecl^y structured S' uiiK ht>wevt?r, the ac:t!vity is 
fesponssbie improvfng the efttcsency of ck^avage the 5' sifc^^ 
tn B^iL (P. Pru^cofonl M. Zamti^D^Ti. ML Balds aiid CP. Tocchtni- 
Vaienti?>t rp.ani(:$cr^pt in p rep^j j'ati on The Ajicaris enzyme aho 
has <t f?^attJre-dort';afn depfjndfjnt 5* restructuririg actsvsty, but: it 
dsffer$ from Jhat of Xmopu^ be<:au:^e js unable to resirucixne a 
Jypkral eiikafvaJ pre-iKi^A stjch as yeast pifMRNA^"^* (P. Fasscobni, 
M. Zambanl MJ, Baldi and CP, T<x:ehlnhVak;m5m m;xnus(:f{pt: 
in :f>ref)ara5k>B). 

Our mode.l pn^Uc^ the eixBtence oi' rn^ilanfs of ihe, eukDryiii 
enxvfTse that lack the 5' restrucatdng -K'Jivjty. Such muii^nls 
would be unab?e to cleave a eukafval f^re-tRNA at the 5' Site, but 
could cleave at the 3' ske. Mori? impoftaf^tly, the^ie r<?sfnfctuHng 
n-^ijtanf'i should clmm precursrOB th.-Jt already have ;5 BfdB, 

The yeast endoeuclease is* ari upyb heterotetramer (Ts'Otta el 
1997). MojTiok)gy re^laf ton^.hfpf^ and other evidefKe su;^ge=^t 
that two ^.ubunit'i. ^>t the enzyme, Ses}2p and- Sen!>4p, cxji^tassi 
distinct active one for the 5' site, ti^e other tar the: y site. 
The liT^utant sim2-3 h thktcWvn \n cleavage of the S' site (Ho et 
nL, \^90)} f-jgiire 23 shows th;5t sejt:? '3 exJrac*^ cleAVfjis the Y 
Ixii not the 5' <ilt(? of yeast pre- tRNA^'^"-. The san^e extftKt, 
hovvevef, cleiwes. pf'^^^^RNA^'^'^^"^'^^ at both ^'Figure 2 By Thus. 
$en2-3 cleaves the 3"" but iniot tbe sites in jiub^^trates lacking a 
BHB, a's would be ffxpected li it bcked the !?Tatu:re-do:!^^a:ju 
depe?Kbnt 5' sste rei^truciurlng ^5c:tlvity. Tht<. co«clu<«{on js- eehi-- 
htced by the fact that pfe^RNA*^^-^^^^^^-^ 3bp VBHB, a substrate 
iha? can intes-^ict with the en^yj-ne only In tt^e j-nature-domain 
df?per^dent mode. Is cleaved only at the 3^ site (Figure 

It is urj^kel v' that the lack ot cle;:ivagee a?, the S' 5>fie o^ p^-tf-lRNA'^^^^ 
resiilt^ ssoi^pSy mm [nac;jfvahon of the catalytic <ite sin^re the 
mulcted iJmsno iK:id fK>t near this sKe, babied <.m the crystal struc- 
fure <3f the enxyme (Li ei nL T996;. Moj^ovec pre-tRNA'^''""-''K 
which ha?; a BHB, h cleaved at both $ite$ ever? though Us imkm 
domain <ihotjkl preveFisl biudiuv^ ot the sen 3 4 aciivf? ^ile ^.o ^.he y 
cite (rigufisi ;^B;. Untortona^dy, f^xpfes5^^on ot a <cn2-3 :md my}^ 
dooble j-riutan? eruytTse i$ very likely to be lethal, rj^aking pfoduc- 
lion oi a doubly aiittated enzyme impossible. 

The ability of the eokaryal cfuyme to reccjgni^e and cleave 
(udepefrdeFUly ol the naature domain creates ihe possibiHty tor 
cleavage of non-tRNA sutistratesi iRgure 3J. the eiskaryai etido- 
nuck^ase can fe<;ognize afid cleave -substrates m the f"5iatufe- 
dornaln irdependent mode^, any RNA that contain$ a BtiB i^truc- 




ussf'ifn^n^i: wmfi iOX> hM) vt.5?;.«J i;j<;«hatod wUh. MeihmnfiWfCcttx 
end5.>uwck':m': (Kh far 90 mm <>a 25"^C); gifrjjjitiai v^^kks tvMm't (GV tor 

(AM iLr.uJ utis). The -^hoNv-is ivnty itu^ h't<ip\' tVa.^rjsstfit TJiuv.t<iU-icj 
Ci.\«ct>rtJ^:>tif^^^ a> 3tK)K tSiai iinti; j?iMiJtEift 1 duptex). C, duplex cofttaJftiftg ihe 



turc should be able lo serve as a sijbsi^'ate. buch a target could be 
^^nerated in mS^NA bv addtofi: a ^foi^abte RNA olmonucletkie. 

Figure .3 $how§ that the arcbaea^ and eukaryal en^yme$ cleave 
mouse projjJ^n I mRNA (Vvidada e? a/., j 989), when the RNA ss 
completed with cjnother oNgonbonucieotidf? forming ii 
Jhh cbavag^^ occurs \i\ a BHB-depender^^ sTianfjer because fully 
doubie-stranded molecules tFigtife 35 and n'jolecuk^s pff^scnting 
•m insertion m three base pairs m the helix of the BFtB are noi 
ck^aved (liata v>o\ shc^v^-n), Figi^re 3 shows tt?at cle;jvags? abo 
occurs in e><tract?i germk^al vesicles? (CiiV- extrjicts). Agasn^ 
cbava}>s^ is BHB-deperdefU, h^ovveve?v cleava^^? m ihh extract 
occurr<?<i m\y m the prf?s<?nc.e ot a 10CMt>ld excess of unreJated 
double strauded RNA (dsRNAy Pre^-tRNA^^=--=^'^-'-, on the contrary, 
is cleaved at high etficiCfKy uiata not shown}. An e?<plaoatjon t^^r 
these diherencofj is the presence in CV extfacts oi adenosine 
dean"iinases (ADARs) Ihat. coriveert i^deno^^snes to inoslncs vvsthin 
dsRNA (Bass and VVesntfaub, Jhereby c;5usmg the RNA 

duplex to tali ispiut, disrupting ihe B^^B ^tfuclute, Presumably^ at 
kw- contenfraticn of cbRNA, AllARs deamksate the sobs^trate 
and, as a resisli of the incre^^sed ssngle^ stranded character of the 
mofecu^e. I'be BrtB b de^J.fOyefi. 

Our results sfxiicate that thetormatfOu of a BHB b an oblig^ite 
st^jp in cleavage by theeukaryai er^dosiucfe<Jses and explatn why 
fhe euk;3iryai endonue!f?a$^ retain the .ability to operate m the 
mjjttire -domain tndejxmdent mode when Jheir n?jtural !?ubstrates 
do not h.'5ve<i BH^, 

METHODS 

Pr^va;RNAs were syn[l"5S:s^zed as described (MaStcjccsa ef a/.. 

Baldi et aL 1992; Fabbri el ab.l998). Templates tor She 
syrsihesjs of the piotdin I jtiRNA nrsolecuies vyere constrtjcted by 
PCR. The PCR templates were the fuJi -length molecules. One 
prlrner contah>ed the T7 promoter. Condjttons for PCR. sran- 
scriptiori by T7 RNtA polymerase, and endor^uclease assays were 
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as described (Mattoccia e?i IWQ; Bal<:ii e?<?C 1992; Fabbfi et 
j 998). Dupktx RNAs were? pa^p^jr^rd *ss d€$cfjb«?ci ijii<iss md 
Wei fslraub, 1988), 
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We have detected two paralogs of the tRNA endonudease gene of 
Methanocaldococcus jannaschii in the genome of the crenarchae- 
ote Sulfolobus solfataricus. This finding has led to the discovery of 
a previously unrecognized oligomeric form of the enzyme. The two 
genes code for two different subunits, both of which are required 
for cleavage of the pre-tRNA substrate. Thus, there are now three 
forms of tRNA endonudease in the Archaea: a homotetramer in 
some Euryarchaea, a homodimer in other Euryarchaea, and a 
heterotetramer in the Crenarchaea and the Nanoarchaea. The 
last-named enzyme, arising most likely by gene duplication and 
subsequent "subfunctionalization," requires the products of both 
genes to be active. 

molecular evolution | RNA-protein interactions | tRNA splicing 

Gene duplication is the primary source of new genes. The 
"subfunctionahzation" hypothesis argues that duphcate genes 
experience degenerate mutations that divide the activity encoded 
by a single ancestral gene among its descendants (1). Here, we 
report a striking example of subfunctionalization. 

In Archaea, the tRNA endonudease plays a key role in assuring 
the correct removal of the intron from pre-tRNAs and pre-rRNA 
(2-6), which constitute the core of the translation machinery. 
Crystal structures of the tRNA endonucleases from Methanocal- 
dococcus jannaschii (METJA) and Archaeoglohus fulgidus 
(ARCFU), both belonging to the phylum Euryarchaeota, are 
available (7, 8). These structures differ in a remarkable way. The 
structure of the homotetrameric endonudease from METJA re- 
veals two different functional roles for the monomeric units. The 
METJA endonudease is organized as a dimer of dimers, with one 
subunit from each dimer participating in the catalytic cleavage 
reaction (the catalytic subunit) and the other (structural) subunit 
acting to place the two catalytic subunits correctly in space. 

The crystal structure of the ARCFU endonudease, by contrast, 
shows it to be a homodimer. Each subunit contains two similar 
repeating domains that are homologous to the subunit structure of 
the homotetrameric enzyme from METJA; the C-terminal repeat 
(CR) is the active domain, and the N-terminal repeat (NR) acts to 
stabilize the dimer. 

The overall shape and size of the homodimeric ARCFU endo- 
nudease resembles that of the homotetrameric METJA enzyme. 

Both METJA and ARCFU belong to the Euryarchaeota. Noth- 
ing is known about the tRNA endonudease of Crenarchaeota, the 
other main family of Archaea. To determine the properties of a 
crenarchaeal endonudease, we searched the genome sequence of 
Sulfolobus solfataricus (SULSO) for homologs of the METJA 
endonudease and found two candidate sequences. 

Characterization of the two gene products reveals that both are 
required for tRNA endonudease activity, each presumably func- 
tioning like one-half of the ARCFU enzyme. Detailed analysis of 
the amino acid sequences of the two proteins supports the idea that 
they evolved by the process called subfunctionalization (1, 9, 10). 

www.pnas.org/cgi /doi/1 0. 1 073/pnas.05023501 02 



Materials and Methods 

Identification of the Homologs. The genes coding for the two 
SULSO proteins were PCR-amplified from genomic DNA by using 
two primers designed to obtain an amplified fragment presenting an 
Ndel site upstream of the gene and a BamHI site downstream. The 
digested PCR fragments were cloned in both pET28 (Novagen) and 
in pCYCA-llb (11). These two plasmids harbor two origins of 
replication that are compatible with one another and also two genes 
coding for different antibiotic resistances (kanamycin and chlor- 
amphenicol). The same procedure was used to clone the gene 
encoding the tRNA splicing endonudease from ARCFU genomic 
DNA. The two truncated forms of endonudease, NR_ARCFU and 
CR_ARCFU, were generated by PCR using the cloned full-length 
gene as template. NR_ARCFU comprises residues 1-138, and 
CR_ARCFU comprises residues 139-326. The PCR products were 
digested and cloned as previously described in pET28b and 
pCYAC-llb, respectively. All sequences of the clones obtained 
were verified. 

Expression and Purification. Both proteins were overexpressed alone 
as hexahistidine-tagged protein (pET28) or coexpressed together 
with the untagged form (pCYCA-llb) mEscherichia coli BL21DE3 
(Novagen). Cells were grown in a 1 -liter culture of Luria-Bertani 
broth at 37°C in the presence of 30 jtxg/ml kanamycin (pET28) and 
adding, in the case of coexpression overproductions, 30 /xg/ml 
chloramphenicol (pCYCA-llb). After cells reached an absorbance 
(at 600 nm) of 0.6, 1 mM IPTG was added to induce expression, and 
cells were grown for an additional 3 h. The bacterial pellets were 
resupended in 50 mM Tris-HCl, pH 8/500 mM NaCl/1 mM 
PMSF/0.1% Tween 20 and lysed by adding lysozyme to a final 
concentration of 100 jitg/ml. The extract was then heated at 65°C 
for 20 min and centrifuged for 1 h at 4°C. The supernatant was 
loaded on a 2-nil TALON cobalt affinity column (BD Biosciences) 
and washed with lysis buffer, and retained protein was eluted by 
adding 100 mM imidazole to the lysis biiffer. We recovered large 
quantities of homogeneous protein "suitable for crystallization 
trials." 

in Vitro Transcription. The DNA template was prepared and tran- 
scribed by T7 RNA polymerase (12-14) using the T7-Megashort- 
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script kit (Ambion, Austin, TX). Transcripts were internally labeled 
with p2p]UTP [800 Ci/mmol (1 Ci = 37 GBq), Amersham 
Pharmacia]. After phenol extraction and ethanol precipitation, 
transcripts were purified on a 10% denaturing polyacrylamide gel, 
eluted, and ethanol-precipitated. 

In Vitro Splicing Assay. Labeled tRNA precursors (20 fmol) were 
incubated with purified splicing endonucleases in reaction mixtures 
containing 25 mM Tris-HCl, pH 7.5/5 mM MgCyiOO mM NaCl/ 
10% glycerol at 65°C for 1 h. Cleavage products were analyzed after 
phenol extraction and ethanol precipitation by gel electrophoresis 
on 10% denaturing polyacrylamide gel. Image analysis was done by 
using a Molecular Dynamics Model Storm 860 Phosphorlmager 
with IMAGEQUANT 4 software. 

Sequence Analysis. The gene coding for tRNA endonuclease sub- 
units in METJA was used as the query sequence in blastp searches 
of the National Center for Biotechnology Information genomic 

database targeting the completed archaeal genomes including the 
species SULSO, Siilfolohiis tokodaii (SULTO), Aeropyrum pernix 
(AERPE), Pyrobacidum aerophUum (PYRAE), ARCFU, Haloar- 
cula marismortui (HALVO), HaJobacterium sp. (HALNl), Meth- 
anothermobacter thermaiitotrophicus (METTH), METJA, Meth- 
anopyrus kandleri (METKA), Methanosarcina acetivorans 
(METAC), Methanosarcina barken (METBA), Methanosarcina 
mazei {METyiPs),Pyrococcus abyssi (PYKAB), Pyrococais fiiriosus 
(PYRFU), Pyrococcus horikoshii (PYRHO), Ferropkisma acidar- 
manus (FERAC), Thermoplasma acidophihmi (THEAC), Jliermo- 
plasma volcanium (THEVO), and Nanonarcheum equitans 
(NANEQ). The proteins detected were included in a multiple 
alignment bixilt by using the program mafft (15). A refinement of 
the alignment was carried out by using the program rascal (16), 
and the quahty was verified by normd (17). 

Results 

Genes encoding the tRNA endonucleases were sought in 20 
completed archaeal genomes by using the METJA subunit se- 
quence as the probe in blastp searches. We found a single gene that 
codes for a polypeptide of *=«180 residues in the genomes of 
PYRAB, PYRFU, PYRHO, METJA, and METTH. Presumably, 
the enzymes from these organisms assemble as homotetramers 
(a4), like the METJA enzyme. 

The single identified genes coding for an endonuclease in the 
genomes of METAC, METBA, METMA, ARCFU, HALVO, 
HALNl, FERAC, THEAC, and THEVO code for a subunit of 
«*350 residues that presumably assembles into a homodimer, as 
does the ARCFU enzyme. 

The search of the genomes of the Crenarchaeota SULSO, 
SULTO, AERPE, NANEQ, PYRAE, and METKA revealed two 
homologs. In the genome of SULSO, one gene codes for a protein, 
Q97ZY3, with an expectation value {E value) of le — 22, clearly 
indicating an ortholog. The size of this protein, 182 residues, 
suggests a possible assembly into a homotetrameric (014) enzyme, as 
in the case of METJA. We named this protein a_SULSO. The 
second homolog, Q980L2, has an lvalue slightly above the twilight 
zone of 0.001 (0.004). The protein encoded by this gene contains 
177 residues. We reasoned that the second gene could code for a 
second subunit of the endonuclease. The METJA enzyme is 
a homotetramer, whereas the ARCFU enzyme is assembled as a 
homodimer with the monomeric unit resulting from duplication 
and fusion of a METJA-like monomer (18). We named the second 
SULSO protein jSJSULSO. The SULSO enzyme might thus as- 
semble as a heterotetramer: a2^2- 

Before describing biochemical experiments that explore the 
activities of the tRNA endonuclease subvinits, we present a bioin- 
formatic analysis of the amino acid sequences found and the 
structures predicted for these sequences. 



We first characterized all of the sequences at the level of their 
primary structure (Fig. 1). The homodimer subunits were divided 
into two monomers of roughly the same size as the tetrameric 
monomer on the basis of the information obtained from the 
crystallographic structure of the ARCFU endonuclease. The NR 
(NR_ARCFU) is composed of residues 1-138, and CR 
(CR_ARCFU) is composed of residues 139-326. All of the protein 
sequences were then aligned as described iw Materials and Methods. 

The sequences were divided into two groups in relation to the 
role played in the enzymatic complex according to the ARCFU 
structure. One set included the structural subunits: the putative 
j8-subunits and the NRs. The second set included the catalytic 
subunits: the a-subunit and the CRs. The homotetrameric subunits 
were included in both groups, because they can act as catalytic or 
structural subunits, depending on how they assemble. For clarity, 
the numbering of the residues used throughout refers to the 
METJA sequence. The multiple alignments of the structural and 
catalytic subunits are shown in Fig. 1. The similarities and identities 
(red and blue letters in Fig. 1) were separately determined for each 
group. Subsequently, homology regions were highlighted by taking 
into consideration the similarities and identities of the two distinct 
groups at the same time. 

When the two sets are compared, three groups of residues can be 
distinguished: residues conserved between the two classes (yellow 
boxes), residues conserved only in the catalytic class (green boxes), 
and residues conserved in the structural class (cyan boxes). The 
majority of residues conserved in conmion between the two classes 
of subunits (yellow boxes) are clustered in two distinct segments of 
15 aa: one segment includes strand j84-helix a2-strand j85 and the 
second includes helix Q;4-loop-strand j86. The first pattern presents 
a motif [L-x-L-x-x-(L,I,F,V)-x-Y-L-x-x-(K,R)-G-x-(L,I)] that is lo- 
calized in the N- terminal domain, and the other presents a motif 
[(K,R)-(Y,F)-Xn-V-Y-x-D-L-(K,R)-x-(K,R)-G-(Y,F)-x-V| that is lo- 
calized in the C-terminal domain. The first motif is not conserved 
in all of the structural subunits of crenoarcheote enzymes. In 
particular, in jS-SULSO, this motif is highly divergent, and it has 
been deleted, along with the entire N-terminal domain, in 
jS-SULTO and j8-PYRAE. The second motif is universally 
conserved. 

In the catalytic class (green boxes), the specific signature residues 
are clustered in loop L6, loop L7, and loop L9. The pattern of loop 
L6 is of six residues with a motif [(K,R)-(T,S)-(F,L,V,I)-K-(Y,F)- 
G]. Two of the three catalytic residues Y^^^ and H^^^ (residues in 
italics) are present in the conserved pattern of loop L7 [Y-Xn-//- 
(A,S)-(D,E)-(Y,F,W)-(I,L,V)], ahhough the size of this loop is 
variable. L9 presents an eight-residue motif [R-(V,L)-(A,S)-H- 
(G,S)-(V,T)-R-i^-(K,R)-(L,M)], which includes the catalytic resi- 
due K^^^ (in italics). 

The structural subunit class (cyan boxes) is characterized by a 
[V-D-(E,D)-(E,D)-x-(D,E)-x-T] conserved pattern in loop LIO, 
and a conserved E at position 135. 

The mutually exclusive sequence motifs of the two classes can be 
represented on a surface calculated from the structures of the 
monomer (Fig. 24) and of the homotetramer of the METJA 
endonuclease (Fig. 2B). Extension of the information available 
from the crystallographic studies to the endonuclease alignments 
strongly supports the existence of a canonical structure. This 
structure is formed by two different domains (Fig. 1). The N- 
terminal domain is characterized by a five-strand antiparallel 
]3-sheet (strands ]81, j82, j83, j34, and /35) packed against helix dl and 
a3, both of which lie perpendicular to al. The C-terminal domain 
is characterized by a five -strand j3-sheet (strands /36, ]37, ]38, ]39, and 
jSlO) sandwiched between helix aA and a5. 

The group of catalytic subunits shares a similar secondary 
structure (Fig. 1). The crenarchaeote proteins, in particular, contain 
an insertion in the loop between j83 and j84 rich in hydrophilic 
amino acids. CR_THEVO and CR_THEAC have lost helices a3 
and a4. The size of loop L7 varies among all of the members of this 
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Fig. 1. Primary sequences and secondary structures of tRNA endonuclease subunits. A sequence alignment of tRNA endonuclease subunits is presented as two 
separate groups as described in the text. Sequences belonging to the structural subunit class are colored cyan, and sequences belonging to the catalytic subunit are 
colored green. Conserved residues are in red, and similar residues are in blue. Cyan columns indicate positions that are conserved among the catalytic subunits. Green 
columns highlight positions that are conserved among the structural subunits. Yellow columns indicate residues that are conserved in both groups. Secondary structure 
elements, as determined from the crystallographic structure of METJA, are indicated on top of the sequence of NRJ\RCFU. Red arrows indicate /3-strands, and green 
cylinders represent a-helices. The plus signs at the bottom of the alignment indicate the conserved catalytic triad Tyr-1 15/His-125/Lys-156. 



group, but the two amino acids of the catalytic triads are always 
conserved. Variation in size and divergence in sequence are also 
present in loop LIO. 



The group of structural subunits shows, in the N-terminal 
domain, strong sequence divergence. In the case of j8_SULTO and 
]3_PYRAE, the entire N-terminal domain has been deleted. 
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Fig. 2. Structural analysis. {A) Results of the multiple sequence alignment 
mapped onto the three-dimensional structure of a METJA monomer (1 A79). 
Residues conserved in the structural subunits are colored cyan; residues con- 
served in the catalyticsubunit are in green. (B) Sequence conservation mapped 
onto the structure of the tetramer of METJA (1 A79). 



NR_HALN1 and NR_HALVO show a larger loop between j36 and 
j87. The sequence and size of loop L7 is extremely varied in all of 
the class members. NR_ARCFU, in particular, shows a major 
deletion at this location. NR_THEAC, NR_THEVO, NR_FERAC, 
NR_H ALVO, and NR_H ALNl are deleted in the loop between a5 
and j89. It is evident from the alignment that loop LIO and the 
residues in the j810 strand, known to play a structural role in the 
dimerization of the ARCFU enzyme and the tetramerization of 
the METJA enzyme (7, 8), are conserved. 

It is evident from this analysis and the published work of 
others that the known tRNA endonucleases of Archaea can take 
one of two forms: (?) a homotetramer in which two of the 
subunits play a structural role and two contain the catalytic sites 
or (if) a homodimer, each of whose subunits, twice the length of 
the METJA class, contains a structural part and a catalytic part 
(7, 8, 18, 19). The bioinformatic analysis suggests that the latter 
enzyme arose from a gene duplication/fusion event, followed by 
a few mutations that allow for subfunctionalization of the two 
regions of the protein. To illustrate this possibility, we reversed 
the putative evolutionary path by cutting the gene encoding the 
ARCFU tRNA endonuclease into two segments, expressed each 
polypeptide independently, and assayed their ability to function 
in vitro. 

For these experiments, we used the substrate tRNA archaeuka 
(Fig. 3/4), a synthetic molecule containing an intron recognized by 
either archaeal or eukaryotic endonucleases (12, 20). Two trun- 
cated forms of the ARCFU enzyme, NR_ARCFU and 
CR_ARCFU, were generated. NR_ARCFU was cloned in pET28 
to express a protein with an N-terminal His tag (His-6 



NR_ARCFU). CR_ARCFU was cloned in pCYAC-llb. His-6 
NR_ARCFU was overexpressed alone or in the presence of 
CR_ARCFU. After lysis, the crude extracts were pulled down by 
using a cobalt TALON affinity resin and extensively washed to 
remove nonspecifically bound proteins. The bound proteins were 
eluted from the resin with imidazole and assayed. Fig. 3B (lanes 4 
and 5) shows that the excision of the intron from pre -tRNA 
archaeuka occurs only when the two truncated proteins are coex- 
pressed. The purified proteins were visualized by Coomassie blue 
staining on SDS polyacrylamide gels (data not shown). These 
results are consistent with the conclusion that NR_ARCFU is the 
structural subunit, acting to place CR_ARCFU, the catalytic sub- 
unit, in space correctly. 

Having shown that the two halves of a homodimer endonuclease 
could be assembled into an active enzyme, we investigated the 
endonuclease predicted from the genome sequence of the crenar- 
chaeote SULSO. Recall that the genome analysis reveals one ORF 
(a) with very high similarity to the homotetramer of METJA and 
another with significantly lower similarity (j8). 

The gene for a_SULSO and the gene coding for the METJA 
monomer were cloned in pET28. The constructs code for proteins 
with a modified N terminus presenting a His-6 tag. The tagged 
enzymes were purified by affinity chromatography as described in 
Materials and Methods, The formation of aggregates was prevented 
by using high salt. Purification resulted, in both cases, in homoge- 
neous proteins, as judged from Coomassie blue staining on SDS/ 
PAGE. To determine whether the a_SULSO enzyme has a dual 
functional role like the METJA monomer, we tested the purified 
a_SULSO protein for cleavage of pre-tRNA archaeuka. We already 
knew that the intron of this substrate could be precisely excised by 
using a partially pixrified extract from SULSO (12). Fig. 3C (lane 6) 
shows that the substrate is correctly cleaved by the purified endo- 
nuclease from METJA but not by the protein o-SULSO (lanel). 
Leaving other possibilities aside for the moment, we conclude that 
a_SULSO cannot tetramerize like METJA. This observation led us 
to consider the possibility that ]8_SULSO allows the a-subunit to 
assemble properly. 

The pull-down approach was then used to investigate the oli- 
gomerization of the two SULSO gene products. Two different 
constructs were produced for each gene to obtain an N-terminal- 
tagged version and an untagged version of the products. The tagged 
versions were overexpressed alone or in the presence of the 
untagged version of the other gene. After purification, the proteins 
were assayed by using pre-tRNA archaeuka as substrate. Fig. 3C 
(lanes 1 and 3) shows that neither j8_SULSO nor o-SULSO alone 
cleaves the substrate; the activity is detectable only when the two 
proteins are coexpressed (Fig. 3C, lanes 2 and 4). 

Thus, SULSO contains two genes, each encoding one subunit of 
a (presumably) heterotetrameric endonuclease, mimicking the spe- 
cialization revealed in the assembly of the artificially created 
subunits of the ARCFU enzyme. 

Discussion 

A Previously Unrecognized Form of Endonuclease in SULSO. We 

present here the finding of a third archaeal tRNA endonuclease 
complex in the crenoarchaeote SULSO that is the result of assembly 
of two different subunits homologous to METJA, each one pre- 
sumably playing one of the two different roles in a heterotetramer 
(Fig. 4). To identify the SULSO enzyme, we searched the se- 
quenced genome by using METJA as a probe and found two 
homologs of the gene. The proteins encoded by these genes were 
expressed alone or together and, after purification to homogeneity, 
were tested for specific cleavage activity. 

The enzyme is functionally active only if both subunits are 
present. Analysis of all of the completed genomes of 20 Archaea 
helped to extend the existence of a second homolog of the METJA 
gene to all Creanoarcheota and the Nanoarcheota and to deter- 
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Fig. 3. tRNA endonuclease activity assays. (>A) Pre-tRNA^''*^'^^^"'^^ consists of two regions derived from yeast pre-tRNAP"^^ (nucleotides 1-3 1 and 38-76) joined by 
a 25-nt insert that corresponds to the bulge- helix- bulge motif of archaeal tRNA"'^''P. (B) Pre-tRNA archaeuka was incubated with four different recombinant 
proteins. The cleavage products were analyzed by electrophoresis on 10% polyacrylamide (29:1)/8 M urea gel, followed by autoradiography. The identification 
of the reaction products is indicated. Lane 1, control (no enzyme added); lane 2, endonuclease heterotetramer of SULSO; lane 3, endonuclease homodimer of 
ARCFU; lane 4, His-6-tagged NR_ARCFU domain alone; lane 5, NR_ARCFU coexpressed with CR_ARCFU domain. (C) Pre-tRNA archaeuka was incubated with five 
different recombinant purified proteins. Cleavage products were analyzed as in B. The identification of the reaction products is indicated. Lane 1, a_SULSO 
subunit; lane 2, clSULSO coexpressed with the j3_SULSO subunit; lane 3, j3_SULSO subunit; lane 4, j3-SULSO subunit coexpressed with the a_SULSO subunit; lane 
5, control (no enzyme added); lane 6, METJA endonuclease. * indicates His-6 tagging. 



mine the phylogenetic distribution of the different endonuclease 

architectures. 

Evolution of tRNA Endonucleases in Archaea. The distribution of 
subunits in the different archaeal phyla and comparison of the 
ahgned proteins provide a imique picture of how these enzymes 
evolved (Fig. 4). The ancestor of the archaeal enzyme was probably 
a homotetramer that, after two gene duplication events (or hori- 
zontal gene transfer), gave rise to a homodimeric form and a 




Fig. 4. Schematic representation of the phylogenetic tree of Archaea 
(adapted from ref. 21). Each rectangle surrounds species sharing the same 
endonuclease architecture. The subunits are named according to Fig. 5. The 
duplication events are indicated by an arrow. 



heterotetrameric form. The duplication events were independent 
from one another. One event took place in the common ancestor 
of Crenoarcheota and Nanoarcheota and resixlted in two genes 




Fig. 5. ModelofthetRNAsplicingendonucleasesof METJA, ARCFU,andSULSO. 
(A) model of the METJA homotetramer. Several important structural features 
discussed in the text are indicated: loop L10 (cyan triangle), the C-terminal /310 
strand (arrow), and the consen/ed catalytic triad residues Tyr-115, His-125, and 
Lys-156 (green pentagon). (B) Proposed subunit arrangement of the ARCFU 
endonuclease. The NR lacks two of the three putative active-site residues. It does, 
however, contain many of the features of the C-terminal domain, which are 
important for structural arrangements of the enzyme: in particular, the L10 
sequence. The C-terminal repeat contains all of the sequence features of the 
METJA enzyme. Black lines represent the polypeptide chain connecting the CRs 
and NRs. (O A proposed model forthe SULSO endonuclease. The a-subunit of the 
SULSO enzyme contains the conserved catalytic triad (green pentagon) and the 
C-terminal jSIOstrand (arrow). The |3-subunit of the SULSO enzyme contains loop 
L10 (cyan triangle) and the C-terminal j310 strand (arrow). 
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coding for two subunits, whereas the other occurred in the ancestor 
of Ai'chaeoglobales, Halobacteriales, and Methanosarcinales, re- 
sulting in an in-frame duplication and giving rise to a single gene 
coding for two fused subunits. 

Gene Duplication in Archaeal tRNA Endonucleases: An Example of 
Subf unctionalization. Two pairs of subunits (Al and A2, and Bl and 

B2) associate to form isologous ditners by means of extensive 
interactions between their j810 strands (Fig. 5A, arrow). The 
carboxyl half of ]310 from one subunit forms main-chain hydrogen 
bonds with the symmetry-related residues of /310 from the other 
subvmit (jSlO'), leading to a two-stranded /3-sheet spanning the 
subunit boundary. The tetramer is formed by means of heterolo- 
gous interactions between the two dimers. The main interaction 
between the two dimers is by means of the insertion of loop LIO 
(triangle in Fig. 5) from subunits A2 and B2 iato a cleft in subunits 
Bl and Al between the N- and C-terminal domaias of each 
monomer. The interaction is primarily polar between acidic resi- 
dues in loop LIO and basic residues in the cleft. This arrangement 
causes the two isologous dimers to be translated relative to each 
other by ^20 A, bringing subunits Al and Bl much closer together 
than A2 and B2, which do not interact at all, and resultiag in an 
arrangement of subunits in which only one symmetrically disposed 
pair of active sites can recognize the substrate. These weak iater- 
actions are likely to have been conserved ia evolution because they 
lead to the reqviired positioning of the two active sites. 

The intersubunit /310-J310' interaction observed in the METJA 
endonuclease is now an intrasubunit interaction in the ARCFU 
endonuclease dimer. ARCFU endonuclease also uses a similar LIO 
interaction, but to form a dimer (Fig. SB). The three residues at the 
previously identified active site in the METJA endonuclease (His- 
125, Lys-156, and Tyr-115; pentagon in Fig. 5) are superimposable 
with those in. the ARCFU endonuclease (His-257, Lys-287, and 
Tyr-246). This result is consistent with the fact that both enzymes 
act on the same bulge -helix-bulge motif in their pre-tRNA sub- 
strates. The connection between the NRs and the CRs is a fully 
extended 10-aa residue segment that wraps around the C-terminal 
domain of the CR before joining the N- terminal domain (Fig. 5B). 
The two CRs, CI and C2, occupy the same spatial location as 
subunits Al and Bl in the METJA endonuclease (in Fig. SB). The 
two NRs, Nl and N2, occupy the same spatial location as the A2 
and B2 subimits in the METJA endonuclease and thus play the 
same structural roles as the A2 and B2 subimits. Significantly, the 
NR has retained the LIO structure-mediating dimerization of 
the ARCFU endonuclease, equivalent to its role in tetramerization 
of the METJA endonuclease (Fig. SB). The LIO structure is absent 
from the CR, suggesting that LIO has a role only in dimerization. 

We generated two truncated forms of the ARCFU enzyme, 
NR_ARCFU and CR_ARCFU. The two halves of the homodimer 
endonucleases can be assembled into an active heterotetramer; 
NR_ARCFU is the catalytic subunit and contains the conserved 
catalj^ic triad (Fig. 5B). SULSO contains two genes, each coding 
for one of the subimits of a heterotetrameric endonuclease, mim- 
icking the specialization revealed in the assembly of the artificially 
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created subunits of the ARCFU enzyme. The a-subunit of the of 
the SULSO enzyme contains the catalytic triad, and the j3-simbunit 
contains the loop LIO. 

The general model of the process of duplication- degeneration- 
complementation proposes that degenerative mutations facilitate 
the preservation of the duplicated functional gene (9, 10). The gene, 
once duplicated, can follow three possible alternative evolutionary 
fates. After mutations in the regulatory regions or a null mutation 
in the coding region, the activity of one copy can be lost (nonfunc- 
tionalization). One copy can acquire mutations that result in the 
gain of a new function (neofunctionalization). Fiaally, both copies 
can accumulate mutations that lead to specialization (e.g., catalytic 
or structural), such that each copy is necessary to function as the 
single-copy ancestor gene (subfunctionalization). 

Sequence analysis based on the available structural data has 
revealed two sets of mutually exclusive and substantially conserved 
motifs, allowing assignment of the endonuclease subunits to two 
classes according to their functional role. The homotetramer pre- 
sents both sets of motifs, which correspond to a subimit capable of 
performing either function, according to how it is assembled in the 
enzyme. The homodimer and heterotetramer present domains or 
subunits subspecialized in one function or the other, presenting, 
therefore, one set of motifs or the other. The biochemical data we 
have presented here clearly support qualitative subfunctionalization 
in the case of the homodiiners and heterotetramers. Both domains 
or subimits are required for the correct assembly of the enzyme to 
process the pre-tRNA precursor. This subfunctionalization is even 
more evident in the case of an ARCFU heterotetramer reconsti- 
tuted by expressing the homodimer gene as two truncated forms. 
The resulting heterotetramer is active. 

Conclusions 

Our results suggest that on an evolutionary scale, the common 
ancestor of all of the archaeal enzymes was a homotetramer with 
four identical subimits that could play both the catalytic role and the 
structural role interchangeably, as in the contemporary methano- 
gens. The appearance of a second copy of the gene resulted in 
subfunctionalization of the two copies, resulting in the splitting of 
the two roles, as in the case of homodimers and heterotetramers. 
The accumulation of mutations compromised one function or the 
other, as is evident from the conservation of signature motife. 
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Members of the three kingdoms of life contain tRNA genes with 
introns. The introns in pre-tRNAs of Bacteria are self-splicing, 
whereas introns in archaea! and eukaryal pre-tRNAs are removed 
by splicing endonucleases. We have studied the structures of the 
endonucleases of Archaea and the architecture of the sites recog- 
nized in their pre-tRNA substrates. Three endonuclease structures 
are known in the Archaea: a homotetramer in some Euryarchaea, 
a homodimer in other Euryarchaea, and a heterotetramer in the 
Crenarchaeota. The homotetramer cleaves only the canonical 
bulge- helix- bulge structure in its substrates. Variants of the 
substrate structure, termed bulge-helix-loops, appear in the pre- 
tRNAs of the Crenarcheota and Nanoarcheota. These variant struc- 
tures can be cleaved only by the homodimer or heterotetramer 
forms of the endonucleases. Thus, the structures of the endonucle- 
ases and their substrates appear to have evolved together. 

molecular evolution | RNA-protein interactions | splicing 

In Archaea, the tRNA splicing endonuclease is responsible for 
the correct removal of introns from pre-tRNAs and is also 
involved in the processing of pre-rRNA and presumably certain 
pre-mRNA (1-4). An RNA motif consisting of a bulge-helix- 
bulge (BHB) is the universal substrate of the endonucleases from 
all archaeal lineages and eukaryotes (5). This motif has been 
shown by biochemical and NMR studies to be comprised of two 
bulges of three nucleotides symmetrically disposed on opposite 
strands and separated by a helix of four base pairs (6, 7). 
Although a consensus sequence has been derived (8), the 
conformation of this structure appears to be more relevant than 
its sequence (9). 

The development of the genomics of Archaea made possible 
a characterization of the genes coding for pre-tRNA substrates 

(10) and the genes coding for the tRNA splicing endonucleases 

(11) . Most introns of archaeal pre-tRNA genes are located in the 
anticodon loop, between nucleotides 37 and 38, the unique 
location of their eukaryotic counterparts. However, in several 
Archaea, mostly in Crenarchaeota, introns have been found at 
other positions: the anticodon stem and loop, the D- and 
T-loops, the V-arm, or the amino acid arm. Marck and Grosjean 
(10) renamed the BHB as hBHBh', indicating with the new name 
that the canonical BHB motif should be enlarged to include two 
outer helices having at least two Watson-Crick base pairs. For 
introns located at 37/38 and elsewhere in the pre-tRNA, ca- 
nonical hBHBh' motifs were not always found. Instead, a relaxed 
hBH or HBh' motif, including the constant central 4-bp helix H 
flanked by one helix (h or h') with at least two Watson-Crick 
base pairs on either side, could be discerned (10). 

We recently detected two paralogs of the tRNA endonuclease 
gene of Methanocaldococcus jannaschii (METJA) in the genome 
of the crenarchaeote Sulfolobus solfataricus (SULSO) (11). This 
finding led to the discovery of a previously unrecognized het- 
erotetrameric form of the enzyme. The two genes code for two 
different subunits, both of which are required for cleavage of the 
pre-tRNA substrate. Thus, three different forms of tRNA 
endonuclease can now be recognized in the Archaea: a homotet- 
ramer in some Euryarchaea (such as METJA), a homodimer in 
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Other Euryarchaea (such ^isArcheoglobiisfiilgidiis, ARCFU), and 
a heterotetramer in the Crenarchaeota (such as SULSO) and 
Nanoarchaeota. The heterotetrameric form of the enzyme, 
arising most likely by gene duplication and subsequent "sub- 
functionalization," reqtiires the products of both genes to be 
active (12, 13). 

Marck and Grosjean (10) were correct to recognize the several 
forms of the substrates but, missing the second subunit of the 
endonuclease from the Crenarcheota, they incorrectly assigned 
particular forms of the substrate to particular enzyme structures. 

In the present article, we analyze the relationship of the 
intron-containing motif of the pre-tRNAs to the tRNA endo- 
nuclease architecture in the Archaea and show that the relaxed 
form of the substrate requires either the dimeric or the het- 
erotetrameric endonuclease to be cleaved properly. 

Materials and Methods 

Expression and Purification of the Protein Constructs. The genes 
encoding the endonucleases from ARCFU, SULSO, and 
METJA were PCR-amplified from the respective genomic DNA 
using two primers designed to obtain an amplified fragment 
presenting an Ndel site upstream of the gene and a BamHI site 
downstream. The digested PCR fragments of the genes coding 
for ARCFU, METJA, and the a-subunit of SULSO were cloned 
in pET28b (Novagen), and the gene coding for the )3-subunit of 
SULSO was cloned in pCYCAllb (14). Ah of the clones 
obtained were verified by sequencing. The proteins were over- 
expressed as hexa-histidine-tagged forms (pET28b) with the 
exception of j8-SULSO (pCYAC-llb) that was untagged to be 
coexpressed together with the tagged form of a-SULSO 
(pET28b) in Escherichia coli BL2IDE3 (Novagen). Cells were 
grown in 1 -liter cultures of Luria-Bertani broth at 37°C in the 
presence of 30 /xg/ml kanamycin (pET28) with the addition of 
30 /xg/ml chloramphenicol (pCYCA-llb) in the case of coex- 
pression. The proteins were purified on a metal affinity column 
as described (11). Homogeneity of the enzymes was assessed by 
Coomassie blue staining of SDS polyacrylamide gels. The tRNA 
endonuclease from the toad Xenopus laevis (XENIA.) was 
purified according to ref. 15. 

In Vitro Transcription and Splicing. DNA templates prepared as 
described (11) were transcribed by T7 RNA polymerase by using 
the Ambion (Austin, TX) T7-Megashortscript kit in the presence 
of [a-32p]UTP (800 Ci/mmol; Amersham Pharmacia). Products 
of the correct size were purified on a 10% denaturing polyacryl- 
amide gel after phenol extraction and ethanol precipitation. 
Labeled tRNA precursors (20 fmol) were incubated with puri- 
fied splicing endonucleases in reaction mixtures containing 25 
mM Tris-HCl (pH 7.5), 5 mM MgCls, 100 mM NaCl, and 10% 
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Fig. 1. tRNA intron motifs and enzyme architectures. Vertical bars indicate 
tine species sharing the same endonuclease architecture (1 1). 04 refers to the 
homotetramer, refers to the homodimer, and a2li2 refers to the heterotet- 
ramer (1 1). The numbers indicate the number of hBHBh (BHB) and HBh or hBH 
(BHL) motifs present in the genome, according to ref. 10. 



glycerol at 65°C for 1 h with the exception of the reaction 
containing the XENLA enzyme that was incubated at 22°C. 

Cleavage products were analyzed, after phenol extraction and 
ethanol precipitation, by electrophoresis on 10% denaturing 
polyacryl amide gels. Image analysis was done by using a Molec- 
ular Dynamics model Storm 860 Phosphorlmager with image 
QUANT software, version 4. 

Results 

A Common Fold for All of the Archaeal Enzyme Subunits Is Stabilized 
by Two Conserved Residue Signatures. The genes coding for tRNA 
splicing endonucleases encoded in 19 different archaeal ge- 
nomes have been described and characterized. Three different 
forms of the endonuclease have been distinguished; the ancestor 
of the archaeal enzyme was probably a homotetramer, which, 
after two independent gene duplication events (or horizontal 
gene transfer), gave rise to a homodimeric and a heterotet- 
rameric form. One event took place in the ancestor of Crenar- 
cheota, resulting in two genes coding for two subunits, whereas 
the other occurred in the common ancestor of Archaeoglobales, 
Halobacteriales, and Methanosarcinales, resulting in an in-frame 
duplication, giving rise to a single gene coding for two fused 
subunits (Fig. 1). We have shown how the homodimeric subunits 
can work as a heterotetramer, by cutting the gene into two 
independent segments each expressing a polypeptide (11). In all 
of the natural tetramers and the artificially generated one, each 



set of two subunits plays a specific role. One set contains the 
catalytic sites, and the other has the structural role of positioning 
the subunits with the active sites. These two different roles have 
resulted in the acquisition of mutually exclusive features that 
allow one to distinguish two functional classes of subunits (11). 

Despite the existence of the two classes of subunits, a modular 
organization is conserved among them. The conserved residues 
in (/34)-(q:2)-(j85) in the N-terminal domain and in (Q:4)-loop- 
(]86) in the C-terminal domain delineate motifs that provide a 
specific signature for the endonuclease family (Fig. 2). Using this 
vmiversally conserved motif we can retrieve the subunits of 
Archaeal endonucleases selectively in the SwissProt database. 
The signature residues, represented in a cartoon model (Fig. 2A) 
of the METJA structure (16), stress the functional importance 
of helix a2 (blue) and helix a4 (purple) for the stabilization of 
each of the two domains and their positioning with respect to one 
another. When the residues are plotted every 100° consecutively 
around a spiral, the conserved residues in a2 are clustered on two 
opposite sides of the helix (Fig. 2B), presenting hydrophobic side 
chains. One face (residues 47, 50, 53, and 54) interacts with 
conserved residues in the ]8-sheet of the N-terminal domain. The 
other face (residues 48, 49, 52, and 56) interacts with the 
C-terminal domain and also directly with helix a4. 

Helix a4 also presents conserved residues on two different 
faces (Fig. 2C). One face (residues 88, 91, 94, and 95) is packed 
against the /3-sheet of the C-terminal domain, where it forms 
specific interactions with conserved residues. The other face 
(residues 85, 89, 92, and 96) interacts with helix a2 and the 
N-terminal domain. These observations support the existence of 
a canonical structure shared by all of the subunits, which implies 
that they all share a common origin (17, 18). 

Canonical and Noncanonical Motifs in Intron-Containing Archaeal 
Pre-tRNAs. Following Marck and Grosjean (10), we examined the 
sequences spanning intron-exon junctions in intron-containing 
pre-tRNAs of 19 Archaea. Particularly interesting are those 
introns whose length is too short to form a second 3-nt bulge 
followed by a helix consisting of at least two Watson-Crick pairs. 
Fig. 3 shows that both hBH and HBh' motifs are characterized 
by a bulge and an internal loop and can be represented by a 
structure that resembles the bulge-helix-loop (BHL), as de- 
scribed in some eukaryotic pre-tRNAs (19). Because, presiun- 
ably, the archaeal endonucleases do not contact the mature 
domain, hBH and HBh' do not appear as different to their 
enzymes. Hereafter, we shall refer to both hBH and HBh' motifs 
as BHL-like motifs. Of 139 intron-containing pre-tRNAs, 82 
contain a BHB and 57 contain a BHL-like motif. Fig. 1 shows 
that genes coding for intron-containing pre-tRNAs character- 
ized by the BHL-like motif are absent from species that carry a 
homotetrameric (0:4) tRNA endonuclease. Pre-tRNAs contain- 
ing BHL-like motifs are found only in those species characterized 
by the heterodimeric (0L2) or the heterotetrameric (a2p2) forms 
of the tRNA endonuclease. 

In Vitro Cleavage of Pre-tRNAs Presenting Either a BHB or a BHL Motif. 

These observations lead to the prediction that 0:4 endonucleases 
require a BHB substrate, whereas 0:2 or a2iS2 endonucleases can 
cleave BHL substrates. These predictions were tested as follows: 
two different uniformly labeled pre-tRNA substrates were used 
for the cleavage assay (Figs. 4 and 5). These comprise a 
pre-tRNA presenting a motif with the intron and the boundary 
region of the 5' exon and 3' exon of the molecule folded into 
either one of a 2- or 3-nt bulge separated by a 4-bp helix (BHB 
motif) (Fig. 4A), and a pre-tRNA presenting a 3-nt bulge and an 
internal loop separated by a 4-bp helix (BHL-like) (Fig. 5^). We 
used as a control the partially purified endonuclease from 
XENLA, because it can process both substrates correctly, based 
on previous observations (19). Each substrate was incubated 
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Fig. 2. Archaeal endonucleases common fold. (A) Cartoon representation of tlie monomeric subunit of IVIETJA (16). All of the side chains represented are 
conserved among all of the subunits of archaeal endonucleases. The residues constituting the two signatures, as specified in the text, are colored blue and purple; 
the other conserved residues are yellow. (B) Helical wheel representation of helix a2. Hydrophobic residues are shaded in gray; only the conserved amino acids 
are numbered. (O Helical wheel representation of helix a4. Representation as above. 



with three different tRNA endonucleases from Archaea, repre- 
senting each one of the three different architectures: the ho- 
modimer of ARCFU, the homotetramer of METJA, and the 
heterotetramer of SULSO. 

Fig. 4B shows the expected resvilt that all of the enzymes cleave 
the BHB substrate correctly. On the contrary, there are differ- 
ences in the processing of the BHL-hke intron. Fig. 5B, lane 2 
clearly shows that the homotetrameric enzyme from METJA 
cannot cleave the BHL-like structure correctly. The enzyme 
cleaves inefficiently, only at the 3' site, resulting in accumulation 
of the 5' half -intron product, whereas the heterotetrameric 
enzyme from SULSO and the homodimeric enzyme from 
ARCFU cleave the BHL-like structure correctly (Fig. SB, lanes 
3 and 4), as does the (eukaryotic) XENLA endonuclease (Fig. 
5B, lane 5). 
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Fig. 3. Canonical and noncanonical motifs. (A) hBHBh', BHB splicing motif 
flanked by two helices (h, h') presenting at least two Watson-Crick base pairs. 
Arrows indicate the 5' 32 -> 3' sense, (fi and O hBH and HBh' relaxed BHB 
motifs. The two superimposed motifs are shaded in gray to show the simi larity 
to a BHL motif. 



Cis and Trans Splicing Motifs. All of the substrates described above 
were characterized by BHB or BHL targets produced by an- 
nealing strands belonging to the same RNA molecule. We turn 
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Fig. 4. In vitro cleavage of a BHB-containing substrate. (A) The pre-tRNA 
Archeuka was constructed by using two regions derived from yeast pre-tRNA''^'® 
(nucleotides 1-31 and 38-76) joined by a 25-nt insert that corresponds to the BHB 
motif of archaeal tRNATrp. (B) The pre-tRNA Archeuka was incubated with four 
different enzymes. The conditions of the reactions are reported in Materials and 
Methods. The cleavage products were analyzed by electrophoresis on 10% 
polyacrylamide gel containing 29:1 monomer to bis and 8 M Urea, followed by 
autoradiography. The identification of the reaction products is indicated. Lane 1 
is control (Q no enzyme added); lanes 2-5 show the products after incubation 
with the endonucleases from METJA, SULSO, ARCFU, and XENLA, respectively. 
The 2/3 molecules are produced by single cleavage. The slowly migrating class 
corresponds to intron-3' exon molecules and the fast migrating class to 5' 
exon-intron molecules. The heterogeneity of the 3' halves results from the 
run-off transcription by T7 RNA polymerase, used to prepare the substrate. 
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Fig. 5. In vitro cleavage of a BHL-containing substrate. {A) The pre-tRNA''^y'^ 
from Caenorhabditis elegans. The synthetic substrate presents a residue 
change at the 5' terminus (C to G) required for 17 transcription and a 
corresponding change (G to Q in the complementary strand. (B) The pre- 
tRNA'^y was incubated with four different enzymes. The conditions of the 
reactions and product analysis were the same as described in the legend to 
Fig. 3. 
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Fig. 6. In vitro cleavage of a substrate presenting a trans BH L. {A) Proposed 
structure of the N. equitans pre-tRNA^'" formed by annealing the products of 
two half-genes. The left side of the structure contains the 5' half-gene, 
including the 5' cleavage site in the loop formed by CGAC. The right side of the 
structure contains the 3' cleavage site comprising the AAU bulge. The long 
GC-rich stem at the bottom of the structure is not covalently closed but that 
does not matter, because it is removed by the splicing endonuclease. (B) 
Cleavage of the pre-tRNA^'" by purified archaeal endonucleases. The sub- 
strate was incubated with three different recombinant proteins. The cleavage 
products were analyzed as described in the legend to Fig. 3 . The identification 
of the reaction products is indicated. Lane 1 is control (C, no enzyme added); 
lanes 2-4 show the products after incubation with the endonucleases from 
METJA, ARCFU, and SULSO, respectively. Because the substrate molecule is 
open at the bottom, the enzymes produce, in add it ion to 5' and 3' halves, two 
intron fragments. IVS (3'h) is the fragment resulting from cleavage at the 3' 
cleavage site, and IVS (5'h) is the fragment resulting from cleavage at the 
5' cleavage site. 



now to describe a substrate characterized by a BHL-like intron- 
exon junction produced by the assembly of two different RNA 
molecules. Nanoarchaeum equitans is a small hyperthermophilic 
archaeal parasite (20). The A^. equitans genome contains nine 
genes that encode tRNA halves; together they account for the 
missing genes encoding the glutamate, histidine, tryptophan, and 
initiator methionine transfer RNA species (21). The tRNA 
sequences are split after the anticodon-adjacent position 37, the 
normal location of tRNA introns (18). Terminal segments of 
these half-tRNAs constitute an intervening sequence that in- 
cludes a 12- to 14-nt, GC-rich RNA duplex formed between the 
y end of the 5' tRNA half and the 5' beginning of the 3' tRNA 
half (Fig. 6/4). The missing A^. equitans tRNAs reveal the 
necessity for assembly of two tRNA half-molecules. Randau et ah 
(22) proposed a model based on their discovery of extended 
reverse complementarity in the intervening sequences: an ex- 
tended GC-rich duplex in the split intron would facilitate 
base -pairing of the two halves. This stable duplex in the intron 
could facilitate folding of the whole tRNA body and promote the 
cloverleaf structure of the tRNA. 

Fig. 6 A shows a schematic representation of a 5' tRNA 
half-gene product (tRNA*^^^) and the corresponding 3' tRNA 
half-gene product formed in A^. equitans. The region between the 
folded tRNA and the intervening RNA duplex at the bottom 
resembles the BHB. Fig. 6A shows that the substrate derived by 
annealing the RNAs coded by the two tRNA^^^ split genes 
(pre-tRNA^i") has a 3' site bulge, a 4-bp helix (with one UG 
pair), and a 5' site in a four-base loop. Therefore, we are dealing 
with a BHL-like motif. Because this structure is located at the 
position where most archaeal tRNA introns occur, the fusion of 
the two halves might involve cleavage by the tRNA endonuclease 
followed by ligation. Here, we demonstrate that tRNA splicing 



endonucleases can cleave at the expected positions in pre- 
tRNA^^^^^ (Fig. 6A). 

We used a representative for each of the three forms of 
archaeal tRNA endonuclease: the homotetramer from METJA, 
the homodimer from ARCFU, and the heterotetramer from ^ 
SULSO. Fig. 65, lanes 3 and 4 shows that pre-tRNA^^^ is | 
correctly cleaved by the ARCFU and SULSO enzymes. The | 
products are 5' and 3' halves plus the two intron fragments. The S 
identity of the bands was verified by sequencing. The METJA S 
enzyme, on the contrary, requires a strict BHB structure, and it ll 
has problems with the recognition and cleavage of the 5' site 
(Fig. 6B, lane 2). We produced a minisubstrate, lacking the 
mature domain (5). Consistent with the fact that the archaeal 
enzymes operate independently of the matvire domain, the 
minisubstrate behaves exactly like the complete pre-tRNA^^" 
molecule (data not shown). 

Discussion 

The BHB, renamed hBHBh' by Marck and Grosjean (10), is a 
universal substrate (5). It is cleaved twice by all of the charac- 
terized tRNA splicing endonucleases, both archaeal and eu- 
karyal. This observation constitutes the main argument to 
postulate the existence of conserved features among the sites 
cleaved by the archaeal and the eukaryal enzymes. Certainly 
conserved is the distance between the active sites (16). 

The only available BHB structure, determined by NMR 
spectroscopy, is that of a 38 -nt RNA, derived from Haloferax 
volcanii pre-tRNA Trp (7). The conformation of the two 3-nt 
bulges is stabilized by stacking interactions between bulge 
nucleotides and bases in helices H, h, and h'. Both bulges appear 
on the same minor groove face of the 4-bp helix H. Not all of the 
archaeal intron-exon junctions can fold into a canonical BHB 
(hBHBh') structure. A relaxed hBH or HBh' motif, including the 
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constant central 4-bp helix H flanked by one helix (h or h') with 
at least 2 bp on each side, is often found (10). We can, therefore, 
conclude that only helix H and one of the two helices h or h' are 
strictly necessary for cleavage in certain Archaea. Because, 
presumably, the archaeal endonucleases do not contact the 
mature domain, hBH and HBh' do not appear different to the 
enzymes. For this reason we refer collectively to both of them as 
BHL-like motifs. 

Three different forms of tRNA endonuclease can be recog- 
nized in Archaea: a homotetramer in some Euryarchaea, a 
homodimer in other Euryarchaea, and a heteroteramer in the 
Crenoarchaea and Nanoarchaea. On the basis of the combina- 
tion of data derived from the study of the phylogenetic distri- 
bution of the motifs at the exon-intron junctions (10) and the 
endonuclease architectures (11) we were led to hypothesize that 
all three forms of the enzyme can cleave the canonical BHB and 
that the relaxed BHL-like motif can be cleaved only by the 
homodimeric {a-^ and the heterotetrameric (a2i32) forms. Our 
biochemical experiments were designed to explore this hypo- 
thetical evolutionary relationship. 

Only homodimers or heterotetramers can cleave the BHL-like 
structures (Figs. 4 and 5). The intron-exon junction motifs and 
the structures of the enzymes are, therefore, evolutionarily 
related. Although major questions regarding the origin of tRNA 
introns are still unanswered, we can speculate that if BHL-like 
motifs appeared as a consequence of events that modified the 
BHB motif, it would be necessary to have on hand forms of the 
enzyme capable of removing the intron correctly. Only those 
archaeal species that, after gene duplication, present an endo- 
nuclease that is either a homodimer or a heterotetramer could 
process the new substrates. This idea is supported by the fact that 
some Euryarcheota present a homodimeric endonuclease, but 
pre -tRNA genes with a BHL-like motif are not encoded in their 
genomes. It appears that the enzyme specificity for the BHB and 
BHL-like substrates is the result of adaptation of similar active 
sites, because the enzymes capable of processing the BHL-like 
structure are also capable of processing the BHB. This substrate 
ambiguity is a conspicuous feature that will be evolutionarily 
exploited in eukaryotic organisms (12, 19, 23). 

The intron excision reaction in Eukaryotes is characterized by 
exquisite dependence on the mature domain (24, 25). The hBH 
type motif resembles the motif found in most yeast pre-tRNAs 
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presenting introns at 37/38 (18). In this case the bulge is often 
>3 nt and a conserved base pair between a pyrimidine of the 5' 
exon (position 32 in tRNA) and a base in the single-stranded 
loop of the intron (position 3) is required for correct cleavage of 
the 3' splice site (26). The conserved base pair has been called 
the A-I pair, where A stands for anticodon and I for intron (27). 
We propose that the relaxed motifs and the consequent ambi- 
guity are a prelude, in the archaeal world, to the loss of autonomy 
of the BHB -type motifs and the advent of the domination by the 
mature domain. A common fold, stabilized by the two- 
conserved-residue signature, characterizes all archaeal endo- 
nucleases, despite the existence of the different enzyme forms. 
The archaeal common fold is not found in the eukaryal enzyme. 
Again, changes in substrate structure correspond to changes in 
enzyme structure, according to the paradigms of coevolution 
(12, 23). 

The BHB or the relaxed BHL structures can be formed both 

in cis and trans. We show that annealing the two tRNA split 
genes from A^. equitants produces a substrate for certain archaeal 
tRNA endonucleases. Because the archaeal tRNA endonuclease 
does not contact the mature domain of the pre -tRNA, but simply 
and directly binds to and cleaves the BHB or BHB-like struc- 
tures, we can expect that transsplicing mediated by the tRNA 
endonuclease is not restricted to tRNA. In fact, we recently 
reported that an archaeal endonuclease (from METJA) can 
catalyze nonspliceosomal mRNA splicing in mouse cells (28). 

Note. Kate Calvin, Michelle D. Hall, Fangmin Xu, Song Xue, and 
Hong Li (29), in agreement with our results, found that the splicing 
endonuclease from SULSO contains two different submits and accepts 
a broad range of substrates. 
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The relationship between enzyme architecture and substrate spec- 
ificity among archaeal pre-tRNA splicing endonucleases has been 
investigated more deeply, by using biochemical assays and model 
building. The enzyme from Archeoglobus fulgidus (AF) is particu- 
larly interesting: it cleaves the bulge- helix- bulge target without 
requiring the mature tRNA domain, but, when the target is a 
bulge- helix-loop, the mature domain is required. A model of AF 
based on its electrostatic potential shows three polar patches 
interacting with the pre-tRNA substrate. A simple deletion mutant 
of the AF endonuclease lacking two of the three polar patches no 
longer cleaves the bulge- helix-loop substrate with or without the 
mature domain. This single deletion shows a possible path for the 
evolution of eukaryal splicing endonucleases from the archaeal 
enzyme. 

molecular evolution | RNA-protein interactions | tRNA endonucleases 

Accuracy in tRNA splicing is essential for the formation of 
functional tRNAs and therefore for gene expression. In 
Bacteria, pre-tRNA introns are self -splicing group I introns, and 
the splicing mechanism is autocatalytic. In Eukarya, tRNA 

introns are small and invariably interrupt the anticodon loop one 
base 3' to the anticodon. In Archaea, the introns are also small 
and often reside in the same location as eukaryal tRNA introns, 
but not always (1, 2). In both Eukarya and Archaea, the 
specificity for recognition of the pre-tRNA resides in the endo- 
nucleases (3-5). These enzymes remove the intron by making 
two independent endonucleolytic cleavages. It is generally ac- 
cepted that the archaeal enzymes act without any reference to 
the mature domain but instead recognize specific structures that 
define the intron- exon boundaries (6-8). By contrast, the 
eukaryal enzyme normally acts in a mature -domain-dependent 
mode (3, 4). 

The exact way in which the eukaryal endonucleases recognize the 
precursors has yet to be determined, but many RNA-protein 
interactions are presumably required. Previous studies have shown 
the role of the three-dimensional structure and of specific invariant 
bases of the mature domain ia recognition and binding by the 
enzyme (3, 4). The importance of structural regularities in the 
precursors suggested an anchor-and-measure mechanism in which 
the endonuclease binds to one or more reference sites in the matiire 
domain, which are common to all pre-tRNAs, and measures the 
distance to the equivalently positioned intron-exon junctions. This 
hypothesis was supported by experiments that involved the engi- 
neering of changes in the distance that separate the mature domain 
from the splice sites. These alterations changed the size of the intron 
in a predictable way: the insertion of 1 bp in the anticodon stem 
increased the size of the intron by two bases, one at each end, and 
the insertion of 2 bp in the anticodon stem increased the size of the 
intron by 4 nt (3, 4). These striking results, which had been observed 
in specific instances, were generalized and taken to mean that there 
are no important recognition elements at the intron-exon bound- 
aries and that the enzyme binds only to the mature domain. Using 
a large number of constructs and in vitro selection techniques, we 
later found evidence that pre-tRNA introns also contribute to the 
specificity of splice site recognition (9, 10). There are also positions 



in the mature domain that participate in the splicing reaction, 
although they are not occupied by the same bases in all pre-tRNAs. 
We called these positions "cardinal positions" (CPs) (9). The 
eukaryal endonuclease is able to cut a precursor at the 3' site if a 
base in the pre-tRNA single-stranded loop of the intron is allowed 
to form an anticodon-intron (A-I) base pair with the base of the 5' 
exon situated at the position immediately following the anticodon 
stem (position 32 in yeast pre-tRNA^^®). A recently observed 
"A-minor" interaction between the first 3' exon nucleotide and the 
A-I base pair provides the structural basis for their interdependence 
(11). These observations suggested that important recognition 
elements also lie at the intron-exon boundaries. 

The mature domain dominates the scenario of eukaryal tRNA 
splicing. There is, however, a single striking exception: a specific 
structure consisting of two 3-nt bulges separated by a 4-bp helix, the 
so called bulge-helix-bulge (BHB) motif, has been shown to be 
recognized and cleaved in vitro by the eukaryal enzyme without any 
reference to the mature domain (12, 13). This finding presents 
interesting evolutionary implications. The BHB pre-tRNA is a 
universal substrate and is also cleaved by the archaeal enzymes. As 
a matter of fact, many intron-containing archaeal pre-tRNAs are 
characterized by a BHB constructed from the intron and from 
exonic sequences. The archaeal endonucleases, in general, remove 
the intron in a mature-domain -independent mode (14). It is because 
of this independence that the intron in archaeal pre-tRNAs can be 
located in different positions relative to the mature domain. Several 
archaeal pre-tRNAs contain relaxed forms of the BHB motif, one 
being the bulge-helix-loop (BHL), that consists of a single 3-nt 
bulge and an internal loop, separated by a 4-bp-helix (15). Some 
archaeal enzymes, such as the enzyme from Sulfolobus solfatarkus 
(SS), can cleave the BHL sul5strate in a mature-domain- 
independent way. We reasoned, however, that, because the relaxed 
forms like the BHL resemble the motif found in most yeast 
pre-tRNAs (presenting, as a rule, introns located one base 3' to the 
anticodon), they could represent a preliminary stage in the advent 
of the dominance of the tRNA splicing by the mature domain (15). 

At a certain stage, an enzyme could have evolved the capability 
of cleaving the BHL substrate only in mature-domain-dependent 
mode, while still capable of cleaving the BHB substrate in a 
mature-domain-independent mode. Such an enzyme would be 
characterized by surfaces providing specific interaction with the 
mature domain. In this article, we describe results of in vitro 
cleavage assays performed using a variety of substrate variants and 
the endonuclease from the archaeon Archeoglobus fulgidus (AF). 
This enzyme cleaves in a mature-domain-independent mode the 
BHB substrate but not the BHL substrate. In this respect, the AF 
enzyme resembles the eukaryal enzymes. An electrostatic potential 
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Fig. 1. Substrates for the in vitro cleavage assays. (A) PretRNA^^^ consists of two regions derived from yeast pre-tRNA^''^^ (nucleotides 1-31 and 38-76), joined 
by a 25-nt insert that corresponds to the BHB motif of archaeal tRNA'^''P. (5) Mini-BHB is the BHB motif of pre-tRNA^^^. (C) PretRNA^^^"- corresponds to pre-tRNA'i'y 
from Caenorhabditis elegans. The synthetic substrate presents a residue change at the 5' terminus (C to G) required for T7 transcription and a corresponding 
change (G to C) in the complementary strand. (D) Mlnl-BHL Is the BHL motif of pre-tRNA^*^"-. The base substitutions in the mini substrates were introduced to 
optimize T7 RNA polymerase transcription (5). 



calculation (16) applied to the AF enzyme structure permits the 
visualization, on the face of the protein that interacts with pre- 
tRNA, of positively charged patches, presumably involved in the 
interaction with specific parts of the substrate, including the mature 
domain. 

Results 

The AF Enzyme Cleaves the BHL Motif Substrate in a Mature-Domain- 
Dependent Mode. We reasoned that, if in the archaeal world tRNA 
endonucleases exist that interact with the mature domain following 
a strategy similar to that adopted by the eixkaryal enzymes, then in 
the corresponding organism there should exist pre-tRNAs present- 
ing the intron located only at the canonical position in the anticodon 
loop region between nucleotides 37 and 38. Interestingly, two 
independent bioinformatic analyses have reached the same con- 
clusion: the Euryarchaea encoding a homodimeric endonuclease 
present pre-tRNAs with the intron located exclusively at this 
canonical position (14). Do homodimeric enzymes, like the AF 
endonuclease, occupy a critical position in the evolutionary route 
from the archaeal to the eukaryal system? 

To investigate this possibility, we used two different pre-tRNA 
substrates for the in vitro cleavage assay. The first substrate pre- 
tRNA^™ presents a BHB motif whereas the second pre-tRNA^^^ 
presents a BHL (Fig. lA and C). Each substrate was incubated with 
three different pre-tRNA endonucleases from Archaea, represent- 
ing each of the three different architectures: the homotetramer (0:4) 
oiMethanocaldococcus jannaschit (MJ), the homodimer (oq) of AF, 
and the heterotetramer (012 J82) of SS. All of the enzymes cleaved 
the BHB substrate (Fig. 2C). On the contrary, there are differences 
in the processing of the BHL substrate. The homotetrameric 
enzyme from MJ did not cleave the BHL. The heterotetrameric 
enzyme from SS and the homodimeric enzyme from AF cleaved the 
BHL substrate, which is, therefore, stable at the assay conditions 
(65°C) (Fig. 2D). These findings confirm results previously obtained 
by us (15). It seems, therefore, that the enzyme specificity for the 
BHB and BHL substrates is the result of adaptation of similar active 
sites, because the enzymes capable of processing the BHL pre- 
tRNA are also capable of processing the BHB. The BHB is a 
universal substrate. The question we address next is whether the 
ability to cleave the relaxed motifs is a prelude, in the archaeal 
world, to the advent of the dominance by the mature domain in the 
Eiikarya. 



We asked first whether each of the three archaeal enzymes, 
representing the different architectures (17), can operate indepen- 
dently of the mature domain on minisubstrates that correspond 
respectively to the BHB (mini-BHB) (Fig. IB) or to the BHL 
(mini-BHL) (Fig. ID) and lack the mature domain. Fig. 2A shows 
that the intron was correctly removed from the mini-BHB by all 
three enzymes whereas the mini-BHL was effectively cleaved only 
by the heterotetrameric SS endonuclease (Fig. IB). The active sites 
of the enzyme from SS therefore allow the BHL to perform 
autonomously. We expected that the homotetrameric enzyme from 
MJ would not cleave this substrate. But to our surprise, the 
homodimeric AF enzyme also failed to cleave the mini-BHL (Fig. 
2B), Inefficient cleavage of the 3' site could occasionally be 
observed (see Fig. SB). This striking observation is in contrast to our 
observation for the full-length pre-tRNA^ ™^ (Fig. 2D). These 
results suggest that the homodimeric enzyme of AF, like the 
eukaryal enzymes, requires the mature domain for cleavage. These 
observations led us to attempt to visualize a footprint of the 
full-length pre-tRNA, on the surface of the AF enzyme, a footprint 
of the full-length pre-tRNA. 

The AF Enzyme Presents RNA Binding Sites Involved in Specific 
Interactions with Different tRNA Domains. We first calculated an 
electrostatic potential using the MJ endonuclease structure deter- 
mined by x-ray diffraction (18). This enzyme presents a face 
interacting with tRNA that is uniformly positively charged (Fig. 
3/4). Instead, the AF enzyme surface charge, calciilated in the same 
way, omitting the RNA substrate, clearly shows three distinct 
positively charged polar patches (11) (Fig. 3B). The central patch is 
formed by residues belonging to the two catalytic repeats; it includes 
the two catalytic sites and is dii*ectly involved in the interaction with 
the BHB. The two flanking patches are identical and symmetrically 
placed with respect to one another. These two peripheral patches 
are formed by 11 basic residues located in the N-terminal domain 
of the AF protein. Four belong to helix 1 (Lys-12, Lys-14, Arg-18, 
and Arg-19); two belong to to helix 4 (Arg-74 and Arg-76); and two 
are on loop 7 (Lys-90 and Lys-91) (11). 

To confirm that we are dealing with three discrete patches and 
not a continuous elongated one, we considered the fact that each 
subunit of the homodimeric AF enzyme contains two similar 
repeating domains that are homologous to the subunit structure of 
the homotetrameric enzyme from MJ: the C-terminal repeat 
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Fig. 2. /n Wtro cleavage of BHB- and BHL-containing substrates. {A) The mini-BHB substrate was incubated witii three different enzymes. (B) The mini-BHL 
substrate was incubated with three different enzymes. (O Pre-tRNA^'^^ was incubated with three different enzymes. (D) PretRNA^""- was incubated with three 
different enzymes. The conditions of the reactions have been reported (15). The cleavage products were analyzed by electrophoresis on 10% polyacrylamide 
gel containing 29:1 monomer to bis and 8 M urea, followed by autoradiography. The identification of the reaction products is indicated. Lane 1 contains the 
control (C, no enzyme added). Lanes 2-4 show the products after incubation with the endonucleases from MJ, AF, and SS, respectively. The 2/3 molecules are 
produced by single cleavage. 



(CAF) and the N-terminal repeat (NAF). We designed a mutant 
enzyme that comprises a deletion of the NAF domain with the 
exception of the last two beta strands that comprise loop 10 at the 
C terminus, known to be important for the assembly of the enzyme 
(Fig. 44). According to our expectations, the mutant enzyme 
(AAF) lacks the two peripheral patches, but it should still present 
all of the residues directly interacting with the BHB (Fig. 45). To 




Fig. 3. Electrostatic surface potentials of MJ {A) and AF {B) endonucleases. 
Positively charged regions are shown in blue, the neutral regions are shown in 
white, and the negatively charged regions are shown in red. The green circle 
encompasses the catalytic site, whereas the cyan circles enclose the two 
putative sites recognizing the tRNA mature domain. The figure was generated 
with PyMOL (http://pymol.sourceforge.net), and the electrostatic properties 
were generated by using the APBS (Adaptive Poisson-Boltzman Solver) soft- 
ware package (16). 



determine whether this truncated enzyme can assemble into an 
active form, we overexpressed and purified it as described in 
Materials and Methods. Fig. 5 A and C shows that the truncated 
enzyme is capable of cleaving the BHB in the presence or absence 
of a tRNA mature domain, but it is no longer capable of cleaving 
the BHL substrate, even in the presence of the mature domain (Fig. 
5 B and C). We conclude that the positively charged peripheral 
patches interact with a set of recognition elements that are pre- 
sented by the mature domain of AF tRNA precursors. 




Fig. 4. Structure of AAF enzyme and model of the interaction with its 
substrate. (>A) Cartoon representation of the parts deleted from AF endonu- 
clease (gray). The arrows indicate the new N termini of the subunits. The BHB 
substrate is shown in green. (B) Electrostatic surface potential of AAF. Posi- 
tively charged regions are shown in blue, the neutral regions are shown in 
white, and the negatively charged regions are shown in red. The pre-tRNA 
model is shown in cyan. 
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Fig. 5. /n vitro cleavage of BHB and BHL containing substrates by the AAF enzyme. (A) Jhe mini-BHB substrate was incubated witin the AF and AAF enzymes. 
(B) The mini-BHL substrate was incubated with the AF and AAF enzymes. (O The pre-tRNA^^^ substrate was incubated with the AF and AAF enzymes. (D) The 
pre-tRNA^'^'- substrate was incubated with the AF and AAF enzymes. The conditions of the reactions have been reported (12). The cleavage products were 
analyzed by electrophoresis on 10% polyacrylamide gel containing 29:1 monomer to bis and 8 M urea, followed by autoradiography. The identification of the 
reaction products is indicated. Lane 1 contains the control (C, no enzyme added). Lanes 2 and 3 show the products after incubation with the endonucleases from 
AF and AAF, respectively. The 2/3 molecules are produced by single cleavage. 



Discussion 

In Archaea, there are three distinct architectures of tRNA 
endonuclease, the enzyme responsible for the excision of the 
intron from pre-tRNA. The crystal structures of the enzymes 
from two Euryarcheota, AF and MJ, are known (18, 19). The AF 
enzyme is a homodimer (0:2). Each subunit comprises two similar 
repeats, but the same repeat performs a different function in 
each of the two subunits. The NAF acts to stabilize the dimer, 
whereas the CAF is the catalytic domain. The MJ enzyme, a 
homotetramer (0:4), is characterized by a similar functional 
subdivision. Two of the subunits play a catalytic role, whereas the 
other two have a structural role. We found that the Crenarcheote 
SS presents two genes homologous to the one coding for the MJ 
protein. One of the two genes is specialized for encoding the 
catalytic subunit and the other for encoding the subunit that 
maintains the structure of the heterotetramer (a2(32)- Presum- 
ably, at the origin, in Archaea the situation was similar to that in 
MJ, with a single gene coding for a single protein able to serve 
both as catalytic or structural subunit. During evolution in AF 
and in SS, the single gene duplicated. In AF the two duplicated 
genes fused and subfunctionalization occurred between the two 
halves of the fused protein: the C-terminal half (CAF) retained 
the catalytic function and the N-terminal half (NAF) specialized 
as the structural subunit. In SS, one of the duplicated genes has 
specialized for encoding the catalytic and the other for encoding 
the structural subunit. Both subunits are necessary for substrate 
cleavage by the heterotetramer (15, 17, 20, 21). 

The protagonist of this paper is the AF enzyme. The homodimer 
cleaves pre-tRNA^™, pre-tRNA^^^, and mini-BHB, but not mini- 
BHL. The AF enzyme, therefore, can cleave the BHL only in a 
mature-domain-dependent mode. Because mature-domain depen- 
dence for cleavage of non-BHB substrates is a main feature of the 
eukaryotic enzymes, we observe in AF the dawn of dominance of 
the mature domain in tRNA splicing. In AF and in all other 
Euryarchaea encoding homodimeric endonucleases, as in Eukarya, 



the intron is located at the canonical position in the anticodon loop 
region between nucleotides 37 and 38. The latter feature is a 
necessary requirement for mature-domain dependence, because it 
assures a fixed geometry relative to the mature domain for all 
pre-tRNAs. Three RNA-binding sites, one central and two periph- 
eral, characterize the surface of the AF enzyme that interacts with 
the substrate (Fig. 3B). The central site contains the two symmetric 
catalytic triads and presumably interacts with the BHB and its 
relaxed forms, like the BHL. The two flanking patches are identical 
and symmetrically placed and are presixmably involved in the 
interaction with the mature domain. They function one at a time, 
because the enzyme can accommodate only one tRNA molecule. 
In an experiment that attempted to reverse the proposed evolu- 
tionary process, we deleted a portion of the N-terminal domain of 
the AF enzyme; the resulting ziAF mutant should lack the two 
peripheral RNA binding regions (Fig. 4 A and B). As predicted, 
AAF is unable to perform in a mature-domain-dependent mode. It 
can cleave only pre-tRNA^™ and mini-BHB; it does not cleave 
pre-tRNA^^^ or, of course, the mini-BHL (Fig. 5). 

Interaction of a Pre-tRNA""^ with the AF Enzyme. We built a model 
using the recently solved structure of the AF enzyme in complex 
with a minimal BHB substrate, generously provided by H. Li (11). 
Interestingly, the asymmetric unit of the co-crystal contains two 
molecules of RNA-enzyme complex and the analysis of the crystal 
packing made it possible to see that each enzyme, besides inter- 
acting with its substrate through the BHB structure, also interacts 
with the A-helix belonging to the substrate of a symmetry-related 
enzyme molecule (Fig. 6A). These interactions involve the side 
chains of Lys-90 and Lys-91, which belong to the group of residues 
forming the above described peripheral positive patch (Fig. 3B). 
Almost certainly, the observed RNA-protein contacts are the result 
of different packing constraints in the crystal lattice and only mimic 
the interaction with a cognate mature-domain substrate in solution, 
but they clearly suggest that a site can be available to host the tRNA 
mature body as we propose in our model. Because of the symmetry 
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Fig. 6. Model of the interactions between the AF enzyme and the substrates. 
Electrostatic surface potential of AF bound either to two BHB RNA molecules 
(shown in green) obtained from the crystallographic structure (1 1) (A) or to a 
model of pre-tRNA (shown in cyan) (fi). 

of the catalytic site, the enzyme can accommodate one tRNA 
molecule at a time, interacting through one or another patch. We 
superimposed, usiag the program O, the 6-bp stem of the anticodon 

arm of the tRNA^^^ to the corresponding 6-bp stem of the BHB 
minimal substrate. The superimposed pre -tRNA mature domain 
faces one of the peripheral patches of the enzyme with the variable 
pocket and T-arm-acceptor stem helix (Fig. 6B). The enzyme can 



approach the tRNA only by using this strategy; iatrons are also 
present in class II tRNAs (tRNA^^ in AF). These tRNAs present 
a long extra arm that could otherwise impede, by steric hindrance, 
interaction with an enzyme approaching from the opposite side. 
Moreover, the T-arm-acceptor coaxial A-helix presents its deep and 
narrow major groove to the enzyme so that the latter can iateract 
only with the phosphate backbone, thereby permitting recognition 
of the tRNA substrate that is not sequence specific. 

From Archaea to Eukarya. The saKent features that characterize the 
AF enzyme are the result of subfunctionalization and creation of 

the binding sites for elements of the mature domain of the pre- 
tRNA. As far as subfunctionalization is concerned, presumably, the 
extreme environments that characterize the habitat of Archaea is 
not ideal for 04 architectures, where a single polypeptide plays two 
very different roles. It is not clear whether drift alone or positive 
selection or both had a role in subfunctionalization. 

We should not forget, however, that subfunctionalization did not 
occur only in homodimeric (ap) enzymes, like AF, but also in 
heterotetrameric (a:2i82) enzymes like SS. The fact, therefore, that 
it occurred twice independently, and by different routes in the same 
gene, argues in favor of positive selection. The acquisition of the 
three RNA-binding sites points to the extraordinary and spectac- 
ular series of events that saw evolution engraving on the surface of 
the AF enzyme the overlapping portraits of two pre-tRNAs. The 
AAF mutant tells us that we can reverse the evolutionary process 
by deleting a sequence of amino acids from the N-terminal part of 
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Fig. 7. Model of a chimeric enzyme presenting two CAFs, one NAF, and tlie human Sen15. Representation of the model chimeric enzyme. The archaeal parts 
are shown in gray, and the human part is shown in red. (B) Electrostatic surface potential of the model chimeric enzyme. Positively charged regions are shown 
in blue, neutral regions are shown in white, and negatively charged regions are shown in red. (C) Primary sequences and secondary structures of the NAF and 
of the human Sen15. Secondary structure elements, as determined by the crystallographic structure of AF (19) and the NMR structure of Sen15 (24), are shown 
in black and red, respectively. The "s" and J3 indicate p-strands, and "h"and a indicate a-helices. Black columns indicate residues that are conserved; gray columns 
indicate residues that are similar. 
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the NAF. The truncated enzyme loses the abiHty to act in a 
matiire-domain-dependent mode. If we compare Fig. 4B with Fig. 
6B, we see that AAF completely lacks the peripheral sites that 
interact, in the wild-type enzyme, with the matiire domain. 

In the intact AF enzyme, the two active sites are completely 
interchangeable. Depending on which peripheral binding site is 
used by the pre-tRNA substrate, a given active site will cleave at the 
5' or at the 3' junction. In the case of the yeast tRNA splicing 
endonuclease, the situation is radically different. The eukaryal 
enzyme is a heterotetramer (a, j8, y, 8) (22). The four subunits 
(Senl5, Sen2, Sen34, and Sen54 in yeast) contribute to the forma- 
tion of two composite active sites that, because of specific interac- 
tions with the mature domain, interact rigidly and specifically either 
with the 5' or the 3' cleavage sites (23). K we wanted, therefore, to 
modify the AF enzyme to make it more similar to its eukaryal 
counterpart, we should think of ways to eliminate one of the two 
peripheral patches. In this way, the active sites would be rigidly 
assigned to one or the other of the two cleavage sites. 

The AAF result teaches us that the peripheral sites comprise a 
sequence of amino acids located in the N-terminal part of the NAF. 
We observed that the sequence of the eukaryal subunit Senl5 does 
not contain the catalytic triad (22) and that therefore it must be a 
structiiral subixnit. Alignment of the Senl5 and NAF sequences 
shows that the eukaryal peptide lacks a segment corresponding to 
the N-terminal part of NAF, which constitute the peripheral site, 
but is homologous to the C-terminal sequence of the AF enzyme 
(Fig. 7C). Because the NMR structure of the human Senl5 (24) is 
available, we aligned the atomic structures of the Senl5 monomer 
with NAF using the program VMD (25). The alignment clearly 
shows that SenlS and the C-terminal portion of NAF share a 
common fold (Fig. 7A). On the basis of these observations, we 
constructed a model of a chimeric enzyme comprising two CAFs, 
one NAF, and the human SenlS substituting for the other NAF. 
Electrostatic potential calculations show that this model chimeric 
enzyme has lost one of the peripheral patches and, like a eukaryal 
enzyme, is characterized by active sites rigidly assigned to either the 
5' or the 3' cleavage sites of the substrate (Fig. IB). 

We expect that, if the route that we hypothesize was really 
followed in evolution, the active sites would drift, obviously con- 
serving the catalytic triad and the ability to cleave the BHB 
substrate, adapting to the specific intron-exon boundaries. Clearly, 
much remains to be done in terms of defining the parts of the 
mature domain that are required for correct positioning of the 
pre-tRNA on the enzyme surface. What we have established is that 
at least one current archeon, AF, retains the memory of the first 
step on the path to the eukaryal system. 

Materials and Methods 

Expression and Purification of the Protein Constructs. The construc- 
tion of the vectors coding for the genes of the endonucleases from 



AF, SS, and MJ and the purification of the enzymes are described 
elsewhere (15, 17). The gene coding for the AAF construct was 
obtain by PCR-amplification by using as a template the previously 
cloned AF gene and the following DNA primers: (/) 5'GGAAT- 
TCCATATGGACGAGTTAAGGCTTGCTGTCG and (//) 
5 ' CGCGGATCCTCA A ACCTTA ACCCTCTCA A AGC. The 
two primers were designed to obtain an amplified fragment pre- 
senting an Ndel site upstream of the gene and a BamHI site 
downstream. After digest, the fragment was cloned into pET28b 
(Novagen, Madison, WI). The correct clones were verified by DNA 
sequencing. The protein was overexpressed as hexa-histidine- 
tagged forms (pET28b) in Escherichia coli Rosetta (Novagen). 
Cells were grown in 1 -liter cultures of Terrific Broth broth at 37°C 
in the presence of 30 /xg/ml kanamycin (pET28) with the addition 
of 30 /xg/ml chloramphenicol. The purification consisted of a metal 
affinity colunm as a first step, followed by gel filtration using an 
analogous strategy already described for the other enzymes (17). 
The purity of the enzyme was assessed by Coomassie blue staining 
of SDS polyacrylamide gels. 

In Vitro RNA Synthesis and Pre-tRNA Splicing Reactions. DNA tem- 
plates were produced as described (17). T7 RNA polymerase 
transcription reactions were carried out following the conditions of 
tiie Ambion (Austin, TX) T7-Megashortscript kit. [a-^^F]lJTF (800 
Ci/mmol; Amersham Pharmacia) (1 Ci = 37 GBq) was included to 
radiolabel the RNA transcripts. All reactions were purified by 
electrophoresis on a 10% denaturing polyacrylamide gel. RNA 
products corresponding to the correct size were eluted, phenol was 
extracted, and ethanol was precipitated. 

tRNA splicing reactions were typically performed in 25 mM 
Tris-HCl (pH 7.5), 5 mM MgCls, 100 mM NaCl, and 10% glycerol, 
and included 20 fmol of tRNA precursor substrates. Purified 
splicing endonucleases were added and incubated at 65''C for 1 h. 
The reactions were stopped by phenol extraction, and ethanol was 
precipitated and separated on 10% denaturing polyacrylamide gels. 
Cleavage products were visualized by analysis on a Molecular 
Dynamics model Storm 860 Phosphorlmager by using ImageQuant 
software, version 4. 
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